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(54) Title: MUTATED CLASS I MAJOR HISTOCOMPATIBILrrY PROTEINS AND COMPLEXES 

(57) Abstract: Provided herein are combinatorial containing chimeric Major Histocompatibility Comples (MHC) Qass I proteins 
displayed on the surfaces of recombinant yeast cells. Members of the libraries, especially where those libraries have been mutage- 
nized eith^ with eiror-prone Polymerase Cahin Reaction or vnth site-directed oligonucleotide mutagenis, are improved in confor- 
mation stability or in binding to a taxget, e.g., a peptide or other ligands as compared with the stability or binding affinity of a wild 
type MHC Class I chimeric protein. The improved mutant chimeric proteins can be selected by various means, including fluores- 
cence activated cell storing with a fluorescent ligand bound to the surfaces of the yeast cells displaying the improved mutant chimeric 
pro^n. 
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MUTATED CLASS I MAJOR HISTOCOMPATIBILnT PROTEINS AND COMPLEXES 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application is a Continuatioii-in-Part of United States application No. 60/254,495, 
filed December 8, 2000. 

ACKNOWLEDGMENT OF FEDERAL RESEARCH SUPPORT 
This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant No. PHS 5 ROl AB5990). Accordingly, the United Slates Government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 
The field of the present invention is molecular biology, in particular, as it is related to 
combinatorial libraries of immune cell proteins displayed on tbe cell surface of a recombinant 
host cell. More specifically, the present invention relates to a library of major 
histocompatibility locus proteins displayed on the surfaces of recombinant yeast cells, to 
mutant MHC Class I and proteins selected for ixapioved binding to particular target peptides, 
to mutant MHC Class Iproteins selected for binding to a particular antigen, to MHC Class I 
proteins of inqiroved stability and to the use of the selected high afBnity and/or more stable 
MHC derivatives m diagnostic methods and imaging assays, among other applications 
includiog prophylactic and tiierapeutic treatments. 

Proteins encoded by the major histocompatibility complex (called MHC proteins) are 
requisite conq>onents of the antigenic con^lexes that are involved in many diseases. These 
diseases include cases where the body reacts with one's own MHC proteins (in various 
autoimmune diseases) or infectious diseases and cancer, where the MHC are critical in binding 
and presenting foreign, antigenic pq>tides. In this invention, MHC proteins of the class I type 
were expressed as heterologous. surfece-Iinfced fusions on yeast cells with the goal of 
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generating improved MHC proteins. Libraries of mutant MHC and mutant peptide-MHC 
complexes could be screened for higher surface levels in order to identify variants that 
exhibited improved properties, including enhanced stability. For the first time, this system j 
allows the directed evolution of MHC molecules that represent novel agents for various ; 
S diagnostic and therapeutic applications. These agents could be used in cancer, infectious 
diseases (e.g., virus infections), and autoimmune diseases (e.g., multiple sclerosis, type I 
diabetes, rheumatoid arthritis). 

T cell receptors (TCRs) and antibodies have evolved to recognize different classes of 
10 ligands. Antibodies function as membrane-bound and soluble proteins that bind to soluble 
antigens, whereas in nature, TCRs function only as membrane-boitod molecules that bind to 
cell-associated peptide/MHC antigens. All of the energy of the antibodyiantigen interaction 
focuses on the foreign antigen, whereas a substantial fraction of the energy of the 
TCR:peptide/MHC interaction seems to be directed at the self-MHC molecule [Manning et al. 
IS (199S) Immmity 8:413:425]. Inaddition, antibodies can have ligand-bindingafBnities that are 

I 

orders of magnitude higher than those of TCRs, largely because of the processes of sonoatic 
mutation and affinity maturation. In their normal cellular context, TCRs do not tmdergo 
somatic mutation, and the processes of thymic selection seem to operate by maintaining a { 
narrow window of afGnities [Alam et al. (1996) Nature 381:616-620]. The association of I 
20 TCRs at the cell surface with the accessory molecules CD4 or CDS also may influence the I 
functional affinity of TCRs [Garcia et al. (1996) Nature 384:577-581]. Despite these 
differences, the three-<iimensional structures of the two proteins are remarkably similar, with 
the hypervariable regions forming loops on a single face of the molecule that contacts the 
antigen. 

25 

There is a long felt need m the art for Class I MHC protems and Class I MHC/peptide 
complexes with inq)roved stabili^ and/or with improved T cell regulatory properties. Such 
hnproved Class I MHC proteins or con^lexes are useful in activating T cells that participate 
in the removal of target cells mchiding neoplastic cells and cells infected with pa&ogenic 
30 agents including, but not limited to, viruses, protozoans, bacteria, fungi or nematodes. The 
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improved Class I MHC proteios and complexes of the present invention are also improved for 
use as research tools. 

SUMMARY OF THE INVENTION 

S The present invention provides combinatorial libraries of Clas& I MHC proteins 

displayed on the surfaces of recombinant host cells, for example, yeast cells, desirably, 
Saccharomyces cerevisiae. From such a library can be isolated mutant MHC proteins that 
exhibit a relatively high affinity for a peptide ligand of mterest. Also within the scope of the 
present invention are methods for isolating mutant Class I MHC proteins with io^roved 

10 stability, especially as MHC/peptide complexes of inq)roved stability. 

The present invention further provides Class I MHC/peptide complexes and proteins 
that exhibit mcreased stability over the wild type Class I MHC/peptide complex or MHC 
protem, and which MHC proteins exhibit high affinity for a peptide ligand interest. This 
15 ligand can be a peptide, a protein, a carbohydrate moiety, or a lipid moiety, among others. 

Suitable labels allowmg for detection of a ligand bound to an MHC protem, directly or 
indirectly, ioctudebut are not limited to fluorescent compounds, chemihiminescent compounds, 
radioisotopes, chromophores, and others. The stable, MHC protem of the present mvention, 
20 where it specijfically binds to a tumor cell antigen with high affinity and specificity can be used 
m diagnostic tests to detect T cells that are specific for the Class I MHC/peptide complex. The 
Class I MHC/peptide complexes of the present invention can also be used to activate T cells 
and thus, to enhance an immune response to an antigen or target cell of interest. 

25 Also provided by the invention are novel yeast display vectors and surface expression 

constructs in which the portion of the fusion protem that mediates attachment to the cell surface 
(flie AGA2 sequence) is located downstream, or at flie carboxy end of the protein/sequence of 
mterest. See Fig. 2. 

30 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides a ribbon diagram corresponding to the crystal structure of a class I 
inajor histocompatibility complex widihigblightedsubunits. The MHC is trimeric. TheMHC 
K** a chain ("350 amino acids) binds to an 8-10 amino acid residue peptide in the peptide 
S binding cleft. The p2 microglobulin (99 amino acids) associates non-covalently with the a 
chain. 

Figure 2 illustrates the displayed MHC protein of interest displayed on the yeast cell 
surface via a disulfide linkage through the AGA2 portion of the fusion protein comprising the 
10 MHC component 

Figure 3 illustrates diagramnoatically die pCT302 yeast surface display vector which 
contains a sequence encoding AGA2/HA-Class I MHC - c-myc fusion protein. This fosion 
protein coding sequence is expressed in yeast under the regulatory control of the GALl-10 
15 promoter. A similar vector, pYDl, is commercially available from Invitrbgen. 

Figure 4 provides diagrams of various single-chain Class I MHC constructs cloned into 
tiie pCT302 Yeast Display Vector, 

20 Figure 5 provides the results of flow cytometric analyses of various K*" constructions. 

Yeast ceUs displaying K?/p2m, SIYR K*'/p2m, dEV8 K%2m, and OVA KVp2m were stained 
with and-c-myc Mab 9E10 and biotinylated anti-K*' antibody (B. 8.24.3) followed by FTTC 
labeled F(ab*)2 goat anti-mouse IgG or SA-PE (shaded peaks). For a background control, 
yeastweretteatedwithonlythe secondary stain (unshaded peak). Labeled yeast were analyzed 

25 on Coulter Epics XL flow cytometer. All four constructions displayed the properly folded 
molecule on the surface. 

Figure 6A-6D illustrate histograms from CD69 up-regulation assays. Induction of T 
cell activation marker CD69 on naive splenocytes (10^ in the presence of KVp2m, anti-TCR 
30 scFv, and SIYR/K*/p2m yeast cells. Induced yeast cells (10^ were incubated at 37''C and 5% 
CO^ with splenocytes from 2C TCR/RAG-1"'* mice. After 20 hours in culture, cells were 

4 
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harvested, washed in PBS/0.5% BSA and stained with biotiiqriated clonotypic antibody. 1B2 
(Kranz at al. (1984) Proc. Nati, Acad. Sci. USA 81: 7922-7926] for 45 minutes. These yeast were 
washed and stained with a mixture of the early activation marker FTTC anti-CI>69 antibody and 
SA:PE. The yeast/T ceU mixture was analyzed for bound FTTC-labeled anti-CD69 antibody 
5 by flow cytometiy, gating on 1B2 positive T cells. The mean fluorescence units for HTCanti- 
CD69 antibody in the absence of yeast, presence of K''/b2m yeast, anti-TCR scFv yeast, and 
SIYR/K!'/p2m yeast are indicated and ilhistrated m flie histograms. Yeast cells that e3q)ressed 
E* with the agonist peptide (called "SIYR"; SIYRYYGL SEQ ID NO:l), but not yeast cells 
wifli K" and no peptide, were capable up-regulatmg the CD69 molecule on the 2C T cells. 
10 This up-regulation was even greater than observed with the positive control aitti-TCR antibody 
KJ16 [Cho et al. (1998) /. Immunol. Methods 220(1-2): 179-188]. • 

Figure 7 shows the results of dkect activation of T cells by yeast that express Class I 
Peptide/MHC, as measured by flow cytometry. Up-regulation on2C TCR/RAG^^ splenocytes 
15 incubated with varying ratios of yeast (3 to 100 x 10* yeast and 10" T cells) were analyzed by 
flow (grtometry. The mean fluorescence units of FTTC anti-CD69 antibody was detected on 
1B2 gated T cells follovMng 20 hours of incubation (37'C, 5% COj) of 2C TGR/RAG^^^ 
splenocytes with yeast that ejqpressed: SIYR/Bd'/p2m , anti-TCR scFv ^sitive control), and 
K?>/p2m (negative control). 

20 

Figure 8 shows direct activation of T cells by pMHC on yeast, dms providing evidence 
forpMHC specificity excess pq)tide and anti-K^ antibody inhibit CD69 up-regulation. In order 
to confirm the specificity of T cell activation, K''/p2m (negative control), SIYR/K?/p2m 
beari^g yeast cells, and 2C TCR/RAG"'' splenocytes were incubated wifli excess OVA 

25 (iSIINFEBa., SEQIDNO:2)peptideoranti-B?antflx)dy (50/tg/mlB.8.24.3)at37°C, 5%C02. 
Excess OVA peptide binds to K!' and should conqpete for binding to OjB molecule, but it is 
not recognized by 2C T cells. Anti-K* antibocfy recognizes the al/a2 helices and should 
prevent bmding by the T cell receptor from 2C T cells. After 1 hour, 20 TCR/RAG^'- 
splenocytes were mixed with die yeast and incubated for 20 hrs. Detection of CD69 on 2C T 

30 cells following incubation with yeast in the absence or presence of inhibitors was detected with 
FTTC anti-CD69 antibody and reported as flie mean fluorescence units. Bofli excess OVA 
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peptide and the anti-K^ antibody inhibited the up-regulation of CD69, confirming that the 
recognition of the yeast-bound SIYR/K*' was specific. 

Figure 9 diagrammatically illustrates cloning and transformation via homologous 
5 recombination. Linear mutagenized MHC protem coding sequence and linearized pCT302 
vector are co-electroporated into yeast. This strategy is used in the preparation of the yeast 
display library for the MHC Class I protem. 

Figure 10 illustrates the strategy for mutagenizing a yeast display library and screening 
10 that library by taking advantage of the bindmg properties of mutant vs wild type displayed 
proteins. Fluorescently labeled ligands or antibodies which bind -the displayed protein are 
employed in flow cytometry assays. 

Figure 11 provides exan[q)les of sortmg libraries generated by random mutagenesis. 

15 A SIYR/K!'/p2m error prone library (SEP) and a dEV8/KVp2m error prone Ubrary (dEP) were 
each sorted 3 times. The SEP and dEP yeast (10^), after having been induced for 2 days, were 
washed with PBS/0.5% BSA. stained with biotinylated anti-K^ (B.8.24.3) for 60 min, washed 
agam, and stained with SA:PE for 30 min. The first sort isolated the top 1.0% of the 
population usmg a 1:25 dilution of anti-K^. The second and third sorts were stained with a 

20 1:250 and 1:500 dilution of anti-K*' for SEP and 1:250 dilution for both the second and third 
sort of dEP. The second and third sorts isolated the top 0.25 % and 0. 1 % of the population, 
respectively. Representative histograms from the second and third sort of the SEP library 
iUustrate tiie mean fluorescence shift of the total population or enhancement of tiie more 
fluorescent yeast cells from sort 2 to sort 3. This indicated tiiat there were likely to be K** 

25 mutants that exhibited mcreased stabiUty and hence, increased surface levels [Shusta, (1999) 
J. Mol Biol. 292: 949-956]. 

Figure 12 shows the 1^ surface levels of mutant clones isolated by sorting from 
dEV8/K^ error prone PGR library. Following sorting, ten randomly selected clones from tiie 
30 dEV8/K*'/p2m library were analyzed on the flow cytometer for binding to the biotmylated anti- 
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K** antibody (B.8.24.3). Binding was detected witii SA:PE, and mean fluorescence for each 
clone was determined. 

Figure 13 shows sequences of mutant clones isolated by sorting from dEV8/K** error 
5 prone PGR Ubrary . The mutations in three clones selected from the dEV8/K*/p2m error prone 
library (sort 3) and the crystal structure with two of the mutations (W167R,Y63N) are shown. 

Figure 14 illustrates construction of a directed, mutagenic peptide library m dEV8/K** 
library construction. Mutagenic PGR of positions 1-3 of the peptide dEV8 was performed 

10 using tiie degenerate upstream primer (5' ATA TTT TGT GTT ATT OCT TGA GTT TTA 
OCA OCT AGC TTG GAT AAA AGA NNS NNS NNS AAA TTC 3 ' , SEQ ID N0:3) and 
a downstream vector-specific primer. Using homologous recombination, the mutagenic 
dEV8/KVp2m PGR product and Nhe I-Nde 1 digested dEV8/KVp2m pCT302 were 
electroporated mto electrocompetent yeast (BY5465, strain EBYIOO). A dilution of tiie 

15 transformed library was plated on SD-CAA plates and uicubated for 3 days at 30°C to obtain 
the Ubrary size (7 x itf). 

Figure 15 shows the properly folded a3 domam, but the al/a2 domams of L*^ are not 
displayed on yeast. L*^/p2m yeast cells were pulsed for two hours with the L'^-binding peptide 

20 QL9 (QLSPFPFDL, SEQ ID NO:4), washed wifli PBS/0.5 %BSA, and stained with anti-L^ 
antibodies, 28. 14.8 (a3 specific) or 30-5-7 (al/a2 specific). After 45 min, cells were washed 
and stained with FITC-labeled F(ab=)2 goat anti-mouse IgG, and binding was detected using 
flow cytometry. The histograms generated from the L''/p2m displaymg yeast represent the 
bindmg of the a3 specific antibody (28.14.8). The 30-5-7 antibody, al/a2 specific, did not 

25 show the same binding (histograms not included), suggesting that at least one of tiiese two 
domains was not folded properly and thus might be stabilized by a process of directed 
evolution and yeast display. 

Figure 16 illustrates the result of sorting a yeast display mutant library of "unstable** 
30 Class I MHO L**. Yeast cells displaymg LVp2m were induced at 20 C for 2 days in the 
presence of an L**/p2m specific peptide, QLSPFPFDL (QL9, SEQ ID N0:4). LVp2m yeast 
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were stained with supematants from D. melanogaster cells expressing the high-affinity T cell 
receptor (m6), biotinylated anti-T cell receptor antibody (F23.1) and a streptavidin- 
phycoerythrin (SA:PE) conjugate (shaded peak). The lul^lm yeast cells were also stained 
with an a2 domain specific antibody (30-5-7) and an a3 domain specific antibody (28.14.8) 
followed by a FITC labeled F(ab')2 goat anti-mouse IgG (shaded peak). For a backgrouiid 
control, yeast cells were treated with only tibe secondary stains (unshaded peaks). Labeled 
yeast cells were detected using a Coulter EPICS XL flow cytometer. The L^'/pim a3 domain 
was folded properly on the sur£ace, but not the a2 domain or the TCR binding domain. 

Figure 17 demonstrates that the L*' mutant 30.8 exhibits a properly folded TCR binding 
domain and properly folded a2/a3 domains. An L^/p2m error prone library was created using, 
homologous recombination and electroporation into electrocompetent 5. cerevisiae EBYIOO 
cells. The L'^/p2m error prone yeast library was sorted with L*^/pb2m a3 specific antibody, 
28.14.8, and an L^/p2m a2 specific antibody, 30-5-7. Sorted yeast cells were screened by 
flow cytometry for bmding to 30-5-7 antibody and high afBnity T cell receptor supernatant 
(m6), as described in Fig. 16 above. Unlike yeast cells that e3q>ress the wild type L''/p2m, flie 
L*'p2m mutant, yeast cells expressing 30.8 were detected by the a2 specific antibody (30-5-7) 
and by the high afiBnity T cell receptor (m6) when the correct pq)tide (QL9) was added 
exogenously. Mutant 30.8 was also detected by the a3 specific antibody, 28.14.8. 

Figure 18 summarizes the results for a yeast display library of peptides that bind to 
MHC class I K^. Known peptides (SIYRYYGL, SEQ ID N0:1; EQYKFYSV, SEQ ID 
N0:5; SHNFEKL, SEQ ID NO:2) have anchor residues at peptide positions 5 and 8. P5 
requires an aromatic amino acid whereas P8 requires a hydrophobic amino acid. Yeast 
displaymg a peptide binding motif, but with the AGA2 fused at flie amino terminus, were 
isolated after incubation of yeast cells with the dEV8 (EQYKFYSV)/K'* tetramer . 

Figure 19 shows that yeast cells displaying the E? peptide motif were detected by flow 
cytometry usmg a fluorescent-labeled tetramer. Yeast cells with the C-tenmnal sequence 
shown m the top panel were stained with dEV80BQYKinrSV)/K'' tetramer^ The 
dEV8 peptide dissociates fix)m the F? pocket, and the AGA2-fused peptide on the yeast surface 
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binds to the free K^. Binding of the peptide motif to the tetramer was detected by flow 
cytometry (shaded peak). For a background control, yeast cells were treated with streptavidin- 
phycoerythrin (SA:PE) conjugate alone. Mutagenic substitution of the proline residue at 
positionP4 with an alanine residue reduced binding significantly (bottom panel), indicating that 
5 the prolione is important in K**-bindmg. 

Figure 20 shows that novel class I MHC binding peptides are isolated by yeast diq>lay 
technology. Using the lead peptides identified by yeast display e35)ermients (Fig. 19), one can 
now identify improved peptides by production of libraries that contain mutations in non-anchor 
10 residues (e.g. P6, P7, and P9). 

DETAILED DESCRIPTION OF THE INVENTION 
The role of proteins encoded by the major histocompatibiUty complex (called MHC 
proteins) has now been known for almost twenty years. MHC proteins are expressed by every 

15 individual and function as "antigenrpresenting" molecules. That is, each MHC protein can 
bind to a variety of different small peptides (8 to 20 amino acids in length) that are derived 
from proteins present mside a cell. MHC proteins present both self-peptides (i.e-, derived 
from an individual's own endogenous protems) and foreign peptides (i.e., derived from a 
foreign agent such as a vuiis) , Once a peptide is bound to the MHC protein, the entire peptide- 

20 MHC complex is expressed on the surface of the cell. If the peptide is foreign, a T lymphocyte 
(T cell) can potentially recognize the complex, and the resultant interaction of the T cell 
receptor (TCR) and the pMHC can result in T cell activation. T cell activation can lead to 
recruitment of other immune cells and a corresponding inflammatory reaction. Such 
inflammatory reactions are beneficial if the pMHC target antigen is, in fact, derived from an 

25 mfectious agent or from a transformed cell (i.e., cancer). However, such mflammatory 
reactions can be very detrimental if the pMHC target antigen is derived from self tissue, as the 
reactions can lead to severe autoimmune disease, where an individual's inmnme system attacks 
normal tissue. Such is the case when a patient's lymphocytes attack the islet cells of the 
pancreas (type I diabetes), tiie nervous systrai (multiple sclerosis), or jomt-<lerived components 

30 (iteumatoid arthritis). 
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The central role of pMHC complexes in these phenomena has been established by 
thousands of published studies that include genetic linkages of diseases to the human MHC 
locus (HLA). It has now also been established that it is possible to use appropriately 
characterized peptide-MHC molecules as either agonists of an immune response (e.g., in 

S cancer and infectious diseases) or as antagonists of responses (e.g. , in autoimmune responses). 
While several approaches have been taken to produce such pMHC complexes in soluble forms 
for these purposes and for biochemical/structural studies » it has not been possible to use current 
methods of in vitro directed evolution to improve the stabili^ or antigenicity of the pMHC 
complex. This is because the pMHC complex is normally a membrane-associated complex 

10 composed of mult^le different subunits (heavy chain, beta-2-microglobulin, and peptide in the 
case of a class I MHC and a-chain, P-chain, and peptide in the case of class n MHC) and such 
proteins are typically not amenable to the ciurent methods of directed evolution (primarily 
phage display). The present invention shows that a display system for directed evolution can 
be used to express properly folded Class I MHC proteins on the surface of yeast. The 

15 displayed pq)tide-MHC conq)lexes can be used to directly activate T cells, in ord^ to 
identify/screen for pMHC antigens. In addition, mutated libraries of the pMHC proteins could 
be created and used for selection by flow sorting of stabilized pMHC variants. The stabilized 
variants can be identified because they were expressed at higher levels on the yeast sur£ace and 

can therefore be easily identified by using a fluorescent-labeled probe for the pMHC construct, 

.* 

20 combmed with higfh-throughput flow cytometric sorting or such cells. 

The Class I MHC proteins (see Fig . 1 for ribbon diagram) are composed of an a chain 
of 350 amino acids and a P2m chain of 99 amino acids; the two chains are noncovalentiy 
associated with one another. Peptides that are about 8-10 residues bind the Class I MHC 
25 molecules. About 10^ to 10^ peptide-Class I MHC conq)lexes are displayed on the surfaces of 
nucleated cells. Class I MHC conoplexes are involved in the recognition of virus infected cells, 
pathogen-infected cells, and tumor antigens by cytotoxic T lymphocytes. 

To date, no Class I MHC protein has been crystallized without a peptide bound in the 
30 peptide binding cleft. Some Class I MHC proteins are difficult or inq)0ssible to produce in 
soluble form because of their instabilities. For exanq^le, the mouse Class I MHC is 

10 
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relatively stable, and it has been used in many studies. By contrast, the mouse Class I MHC 
L** is less stable, and it has been more difficult to produce. The stability of a particular 
peptide-Class I MHC complex is direcfly related to its ability to stimulate efficient T cell 
responses, for example, in vaccine applications. A system for the in vitro evolution of more 
5 stable peptide-MHC complexes allows for novel agents and vaccines to be used in stimulating 
protective immune responses against diseases and i^plastic conditions. 

The present invention allows the creation and isolation of stabilized variants of peptide- 
MHC complexes. Toward tiiis end, we have displayed single-chain peptide/Class I MHC (a 

10 chain/p2m) complexes on the surface of yeast cells. Yeast cells .e3q[)ressing a specific 
peptide/MHC complex (SIYR/K**) on the cell surface were capable of directiy activating T 
cells, thus providing a method for screemng powerful T cell agonists. Mutant libraries of a 
less stable peptide/MHC conq)lex (dEV8/K^ were made and expressed, and mutant MHC 
polypeptides of increased stability were isolated. Shnilarly, the less stable MHC L** was 

15 eiqpressed as a mutant library displayed on the sur&ce of recombinant yeast cells, and more 
stable variants were isolated using flow cytometry screening methodology. WO 99/36569, 
incorporated by reference herein, provides abundant discussion of this display technology. 

Fig. 2 illustrates the MHC protein of mterest displayed on the yeast cell surface via a 
20 disulfide linkage through the AGA2 portion of the fusion protein conq)rising the MHC 
conq)onent. AGA2 is a mating adhesion receptor which is naturally bound to the cell surface 
in disulfide linkage to the AGAl protein. The HA and the c-myc portions of the displayed 
fusion protein serve as epitope tags and can be used in normalizing the fluorescent peptide 
bmding data. Each recombinant yeast cell displays about 50,000 copies of the surface-bound 
25 fusion protein (if stable) on its surface. A fluorescent antibody or peptide ligand is added, and 
the cells are sorted using flow cytometry. Those MHC fusion proteins of increased stability 
exhibit stronger binding of the fluorescent ligand, and tiiese cells are selected during tiie cell 
sorting procedure. 

30 Fig. 3 illustrates diagrammatically the pCT302 yeast sur£ace display vector which 

contains a sequence encoding AGA2/HA-Class I MHC - c-myc fusion protem. This fusion 
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protein coding sequence is expressed in yeast under the regulatory control of the GALl-10 
promoter. A similar vector, pYDl , is commercially available from Invitrogen (Carlsbad, CA). 

Fig. 4 shows various constructs for the e?q)ression of fusion proteins containing 
portions representing the peptide binding portions of various Class I MHC proteins. 
Surprisingly, the AGA2 portion was functional either at the N-terminus or the C-terminus of 
the fusion protein, and it mediated binding to the yeast cell wall surface when associated with 
any of the protein portions tested and in both the anuno- or carboxyl ternMnal positions. In 
Fig. 4 SS refers to a signal peptide sequence necessary for proper nxtracellular transport of the 
fusion protein. The signal peptide is cleaved prior to display on the cell surface. The MHC 
polypeptide coding portion includes a sequence encoding a 15 amine acid spacer between the 
COOH terminus of the a chain and the NHj end of the p2m portion. HA refers to the peptide 
tag derived in sequence from hemagglutinin, which tag is located at the amino terminus of the 
a chain. C-myc refers to the peptide tag at the COOH tennmus of the b2m portion. The three 
peptides linked at the amino terminus of represent a stroi^ agonist ("SIYR" peptide, SEQ 
ID N0:1), a weak agonist (dEV8) and a nuU peptide (OVA) for the T cell clone called 2C, 
The AGA2 coding sequence was cloned at the COOH termini of these constructs in order to 
allow free NHj termmi of the peptides, which is generally thought to be important for proper 
binding to Class I MHC polypeptides. All fusion protein coding sequences were assembled 
using standard polymerase chain reaction (PCR) strategies. 

The correct foldings of various fusion display proteins were confirmed by fluorescence 
activated cell sorting after binding of the recombinant yeast cells to fluorescently labeled 
antibody specific for c-myc or K^. See Fig. 5. 

To demonstrate the direct activation of T cells by recombinant yeast cells expressmg 
a peptide/MHC complex, naive splenic T ceUs from a TCR transgenic mouse (2C TCR tg) 
were mcubated with yeast cells expressing SIYR/K\ K?* (negative control) or anti-TCR scFv 
(positive control). After about 20 hours in culture, the T cells were analyzed by flow 
cytometry for die up-regulation of the activation marker CD69. See Figs. 6A-6D for a 
graphical display of the results. 
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Fig. 7 shows the dependence of T cell activation on T ceUs and on the presence of a 
peptide/MHC complex. Fig. 8 demonstrates the pMHC specificity: excess free peptide and 
anti-K*' antibody inhibit CD69 up-regulation in T cells. 

5 We have demonstrated direct activation of T cells by yeast cells that express 

peptide/MHC. Splenocytes (10^) from 2C TCR/RAG-l^" mice were combined wifli 
approximately 3-100 x 10^ of KF washed yeast cells displaying KVp2m, SIYR/K''/p2m, or 
anti-TCR scFv (positive control) at 37 C, 5% CO2 m a 24 weU plate. After 20 hours in 
culture, cells were harvested, washed in PBS/0.5%BSA and stained with biotinylated 
10 clonotypic antibody, 1B2 [Kranz et al. (1984) supra] for 45 minutes. These yeast/T ceU 
mixtures were washed and stained with a mixture of the early activation marker FITC-labeled 
anti-CD69 antibody and SA:PE. The yeast/T cell mixture was analyzed for the CD69 marker 
(present only on activated T cells) by flow cytometry and gating on 1B2 positive T cells. 

15 The yeast display system was exploited to produce a random mutagenized library from 

which stabilized mutant Class I MHC sequences were isolated. Constructs encoding the 
fusion protems containmg SIYR/K^ and dEV8/K^ portions were mutagenized randomly using 
error prone PGR (0.16 Mn:Mg molar ratio). The homologous recombmation scheme 
illustrated in Fig. 9 was enqjloyed to create the libraries. From flie SIYR/K* experiment, 3,6 

20 million transformants were recovered. From flie dBV8/K* experiment, 2.7 million 
transformants were recovered. 5 plasmid mserts from each mutated library were sequenced 
(about 3200 bp of sequem^e per library) to determme tiie mutation frequency. In the SIYR/K^ 
experiment the mutation frequency was 0.37% (2 wild type sequences, 3, 4, and 5 mutant 
sequences). In the dEV8/K*' experiment, there was a 0.06 % mutation frequenqr (4 wild type, 

25 1 mutant sequence). 

Fig. 10 diagraromatically presents the generic soreening strategy for screwing the 
mutated yeast display libraries. Fig. 11 ilhistrates the results for second and durd sorts with 
anti-K!" monoclonal antibody B.8.i24.3. In tiie second sort the top 0.25% of the cells 
30 (according to fluorescence intensity, mfit, mean fluorescence units) were selected, and in the 
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third sort, the top 0. 1 % of the cells were selected. The profiles were similar for the dEV8/K^ 
experiment. 

Fig. 12 illustrates different K** binding as reflected in different levels of fluorescent 
peptide bound after sorting of the dEVS/K'* mutant library produced using error prone PGR. 
Fig. 13 presents selected sequences of mutant clones isolated by sorting of the dEV8/K?* mutant 
library, and the positions of flie mutations are shown on the ribbon diagram of the Class I 
MHC protein. The followmg provides further details on the construction of a dEV8/K^ mutant 
library. From the ribbon diagram it is deduced that Ae sequence EQYKFYSV (SEQ ID 
N0:5) is unportant. The E (PI, the first amino acid of the peptide sequence) is buried in the 
first pocket. Q, P2, is directed down into the pocket. Y, P3, is a-bulky, secondary anchor 
residue. K, P3, is a primary TCR contact, and it is directed out of the pocket. The F residue 
(P4) is an aromatic residue directed into the pocket; it serves as a primary MHC anchor. The 
Y residue at P6 is a large aromatic residue which also functions m TCR contact. The S residue 
(FT) is a small residue; the small size is dictated by the space available. The last V residue 
(PS) serves as a primary anchor residue, and it is directed into the pocket. To select mutations 
fliat stabilize dEVS bmdmg to K** , mutations are mtroduced m the library at positions that point 
mtoK^ A degenerate horary is produced atPl-P3(i.e.,NNNKFYSV, SEQ roNO:^^^ That 
library is constructed by error prone PCR and the mutant library is mtroduced into tiie wild 
type dEVS/K** plasmid by homologous transformation as shown in Fig. 9. Isolated 
recombinants are then sequenced. Of four sequenced, each contained different nucleotides 
encoding P1-P3. See Fig. 14 and its description herein above. 

The motivations for using directed evolution to isolate stabilized variants of L** include 
the lower stability of the wild type L*^ protein based on biochemical studies, the suboptimal 
loading of peptides m flie endoplasmic reticulum, the slower mtracellular transport of (4 hr 
V 1 hr), the lower cell surface expression (2-4 fold lower) and fewer a/p2m contacts. See 
Table 1. 

Fig. 15 provides data demonstrating that the prop^ly folded a3 domain but not the 
al/a2 domains of L*^ are displayed on the surface of recombinant yeast cells. We have also 
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demonstrated that and-L** al/2 domain (monoclonal antibody 30.5.7) did not bind to the same 
L^/p2m displayed on yeast cell surfaces. 

In summary, the Class I MHC was cloned as an AGA2 fusion in single chain format 
(AGA2-HA-K**-p2m-<:-myc). The full protein was detected on the yeast cell sur£ace using an 
antibody that recognizes the properly folded al/a2 donudns of K^. The coding sequences of 
this protein were also cloned with AGA2 at the carboxy terminus, with K!* binding peptides 
(SIYR, dEV8 and OVA) linked at the ammo termmi (peptide-HA-K*-p2m-c-myc-AGA2). 
Yeast that expressed the T cell specific peptide K** on the cell surface were capable of directly 
activating T cells, thereby providing a system for screening T cell agonists. 

! 

To create stabilized mutant fusion protein, two different mutational libraries were 
produced by error prone PGR to yield random mutations alter subsequent homologous 
recombination of mutagenized coding sequence and Imearized vector after co-electroporation. 
Mutants that displayed higher yeast sur&ce levels, and thus presumptive enhanced stability, 
were identified by flow cytometry sorting. 

The unstable class MHC protein was also cloned as an AGA2 fusion in susgle cham 
format (AGA2-HA- L*-p2m-c-myc). L** was detected on Ae yeast cell surfece with an a3- 
specific antibody, but properly folded al/cc2 domains were not detected with anti- L** antibody, 
suggesting that stabilized variants of can be engmeered by directed evolution and flow 
sorting for inqproved yeast surface expression. 

The present inventors have now succeeded in isolating stabilized mutants of the class 
I molecule L*^. The L*^ molecule has been shown to be relatively unstable. When expressed 
in the yeast display system, only the a3 domain appears to be folded properly on the sur£ace 
(Fig. 16). The displayed protein is not recognized by a conformation-specific a2 antibody (30- 
5-7) nor is the QL9/L* complex recognized by the high-afiBnity T cell receptor called m6 
(Fig.l6). To isolate stabilized L**, a library of random nmtants was e:q)ressed in yeast, and the 
library was selected wifli anti-L** antibody 30-5-7; see Fig. 17 and its description. Various 
mutants were isolated. One mutant (30.8) is shown m Fig. 17. Mutant30.8 bound to both the 
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a2-specific antibody and the QL9/L** (30.8) complex bound to the high affinity TCR m6, 
reflecting increased stability over its wild type parent protein. 

Another application of the yeast display to technology is to identify peptides that bind 
directly to MHC class I proteins. Serendipitously , we isolated a peptide that bound to the 
molecule even though the peptide was fused to flie C-termiims of the AGA2 protein. This 
finding was completely une>qpected. Previously, only peptides of 8 to 10 amino acids had been 
shown to bind to class I MHC products . This restriction in length was thought to be due to the 
need for both the N- and C-tenmni to bind within pockets of the class I molecule, thereby 
contributing binding energy to the interaction. AGA2-fusion peptides were isolated by 
screening with a dEVS/K*" tetramer. A carboxy terminal peptide sequence HYSPFRQLA (SEQ 
ID NO:37) that had K^-binding consensus anchor residues at positions P5 and P8 was isolated 
(Fig. 18). Yeast displaying the AGA2-fusion with the sequence HYSPFRQLA (SEQ ID 
NO:37) at the C-terminus bound to dEV8/K^ tetramers (Fig. 19). This binding was a 
consequence of the relative instabiUty of dEV8 binding to K^ This instability apparently 
allows the peptide dEV8 to dissociate, and the surface-bound AGA2-fusion peptide binds to 
the firee K\ Without wishing to be bound by any particular theory, we believe that the prolme 
at position P4 allows the peptide to exit flie K'^-site at the residue amino temSinal to flie P5 
anchor. This is supported by tiie finding that an alanine substitution mute^ 
has significantty reduced binding (Fig. 19). These findings show that it is possible to create 
a Ubrary of fused peptides (to AGA2) and that this library can be screened for K'^-binding (Fig. 
20). Thus, the yeast display system can be used to identify novel class I-bmding peptides. 

In order to provide a clear and consistent understanding of the specification and claims, 
including the scope to be given to such terms, the following definitions are provided. 

A coding sequence is tiie part of a gene or cDN A whidi codes for the amino acid 
sequence of a protein, or for a fiinctional RNA such as a tElNA or rRNA. 

Conq)lement or canqplementary sequence means a sequence of nucleotides which forms 
a hydrogen-bonded duplex wifli anoflier sequence of nucleotides according to Watson-Crick 
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base-pairing rules. For exanD5)le, the complementary base sequence for 5'-AAGGCT-3' is 3'- 
TTCCGA-5\ 

Downstieam means on tbe 3' side of any site in DNA or RNA. 

5 

Expression refers to tbe transcrq)tion of a grae into structural RNA (rRNA, tRNA) or 
messenger RNA (mRNA) and subsequent translation of a mRNA into a protein. 

An amino acid sequence fliat is functionaUy equivalent to a specifically exen^lified 
10 MHC protein sequence is an amino acid sequence that has been modified by single or multiple 
amino acid substitutions, by addition and/or deletion of amino acids, or where one or more 
amino acids have been chemically modified, but which nevertheless retains the binding 
specificity and high affinity bmding activity of a cell-bound or a soluble MHC protein of the 
present invention. Functionally equivalent nucleotide sequences are those that encode 
1 S polypeptides havmg substantially the same biological activity as a specifically exemplified cell- 
bound or sohible MHC protein. In the context of the present invention, a soluble MHC protein 
is lacks the portions of a native cell-bound MHC and is stable in solution (i.e., it does not 
generally aggregate m solution when handled as described herein and under standard conditions 
for protein solutions). 

20 

Two nucleic acid sequences are heterologous to one another if the sequences are derived 
from separate organisms, whether or not such organisnas are of different species, as long as 
the sequences do not naturally occur together in the same arrangement in the same organism. 

25 Homology refers to the extent of identity between two nucleotide or amino acid 

sequences. 

Isolated means altered by the hand of man fixnn the natural state. If an "isolated" 
conoposition or substance occurs in nature, it has been changed or removed from its original 
30 environment, or both. For example, a polynucleotide or a polypeptide naturally present in a 
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living animal is not isolated, but the same polynucleotide or polypeptide separated from tlie 
coexisting materials of its natural state is isolated, as the term is employed herein. 

A linker region is an amino acid sequence that operably links two functional or 
structural domains of a protein. 

A nucleic acid construct is a nucleic acid molecule which is isolated from a naturally 
occurring gene or which has been modified to contain segments of nucleic acid which are 
combined and juxtaposed in a manner which would not otherwise exist in nature. 

Nucleic acid molecule means a single- or double-stranded linear polynucleotide 
containing either deoxyribonucleotides or ribonucleotides that are linked by 3*-5'- 
phosphodiester bonds. 

Two DN A sequences are operably Imked if the nature of the Imkage does not interfere 
with the ability of the sequences to effect theu: normal functions relative to each other. For 
instance, a promoter region would be operably linked to a coding sequence if the promoter 
were capable of effecting transcription of that coding sequence. 

A polypeptide is a linear polymer of amino acids that are linked by peptide bonds. 

Promoter means a cis-acting DNA sequence, generally 80-120 base pairs long and 
located upstream of the initiation site of a gene, to which RNA polymerase binds and initiates 
correct transcription. There can be associated additional transcription regulatory sequences 
which provide on/off regulation of transcrq)tion and/or which enhance (increase) expression 
of the downstream coding sequence. 

A recombmant nucleic acid molecule, for mstance a recombinant DNA molecule, is a 
novel nucleic acid sequence formed m vitro through the ligation of two or more 
nonhomologous DNA molecules (for exan^^le a recombinant plasmid containing one or more 
inserts of foreign DNA cloned into at least one cloning site) , through PGR technology or by 
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directed homologous recombination, e.g. by co-transformation of two or more DNA molecules 
having at least regions of limited sequence identity to aUow for homologous recombination 
with the transformed cell. . 

Transformation means the directed modification of the genome of a cell by the external 
application of purified recombinant DNA from another cell of different genotype, leading to 
its uptake and integration into the subject cell's genome. In bacteria, the recombmant DNA 
is not typically integrated mto the bacterial chromosome, but instead replicates autonomously 
as a plasmid. 

Upstream means on the 5 ' side of any site in DNA or RN A; 

A vector is a nucleic acid molecule that is able to replicate autonomously ia a host cell 
and can accept foreign DNA. A vector carries at least one origin of replication functional in 
at least one type of cell, one or more unique recognition sites for restriction endonucleases 
which can be used for the insertion of foreign DNA, and usually selectable markers such as 
genes coding for antibiotic resistance, and often recognition sequences (e.g. promoter) for the 
expression of the inserted DNA. Common vectors include plasmid vectors and phage vectors. 
There can be more than one origin of replication to allow for replication and maintenance in 
more than one type of cell (e.g., separate origins of replication functional iu yeast and 
Escherichia coti^ respectively). 

A virus infected cell is a cell in which a virus is replicating. Typically , a virus infected 
cell displays at least one antigen on its surfece which is characteristic of the virus infection 
process. Such an antigen can be the target of recognition by the inomune system and 
subsequent killing of the cell. 

A pathogen infected cell is a human or animal cell in which an intracellular parasite 
(bacterial, fungal or protozoan) is surviving or rq)roduciqg. Typically a pathogm infected cell 
displays at least one antigen on its surface which is characteristic of the infection, and this 
antigen can be the target of immune recognition and targeting for killu^ of tbe infected cell . 
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The role of proteins encoded by the major histocompatibility complex (MHC proteins) 
have been known for nearly twenty years. MHC proteins are expressed by every individual, 
and they function as antigen-presenting molecules. Each MHC protein can bind to a variety 
of different small peptides (8 to 20 amino acids). 

Recently it was demonstrated that a scTCR (Vp-lihker-Va) could be displayed on the 
surface of yeast [Kieke et al. (1999) Proc. Natl Acad, ScL USA 96:5651-5656], in the yeast 
display system tiiat has proven successful in antibody engineering [Boder and Wittrup (1997) 
Nat. Biotech. 15: 553-^557.; Kieke et al. (1999) supra]. In addition, it was shown that mutations 
that increased the surface levels of the TCR also increased the sability of the TCR in solution 
[Shusta et al. (1999) J. Mol Biol 292:949-956] . Thus, yeast surface display can now be used 
to isolate proteins that exhibit greater stability. 

It will be appreciated by those of skill in the art that, due to the degeneracy of the 
genetic code, numerous functionally equivalent laicleotide sequences encode the same amino 
acid sequence of an improved Class I MHC protein or class I MHC/peptide complex. 

Additionally, those of skill in the art, through standard mutagenesis techniques, in 
conjunction with the antigen-finding activity assays described herein, can obtain altered MHC 
class I protein sequences and test them for the expression of polypeptides having particular 
bindiDg activity. Useful mutagenesis techniques known in the art include, without limitation, 
oligonucleotide-directed mutagenesis, region-specific mutagenesis, linker-scanning 
mutagenesis, and site-directed mutagenesis by PCR [see e.g. Sambrooketal, (1989) vide infra, 
and Au^ubel et al. (1999) vide infra] . 

In obtaining variant MHC coding sequences, those of ordinary skill in the art will 
recognize that MHC-derived proteins can be modified by certain amino acid substitutions, 
additions, deletions, and post-translational modifications, without loss or reduction of 
biological activity. In particular, it is well-known that conservative amino acid substitutions, 
that is, substitution of one amino acid for another amino acid of similar size, charge, polarity 
and conformation, are unlikely to significantly alter protein function. The 20 standard amino 
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acids that are the constituents of proteins can be broadly categorized into four groups of 
conservative amino acids as follows; the nonpolar (hydrophobic) group includes alanine, 
isoleucine, leucine, methionine, phenylalanine, proline, tryptophan and valine; the polar 
(uncharged, neutral) group includes asparagine, cysteine, glutamine, glycine, serine, threonine 
5 and tyrosme; the positively charged (basic) group contains arginine, histidine and lysine; and 
the negatively charged (acidic) groiqp contains aspartic acid and glutamic acid. Substitution m 
a protein of one an:iino acid for another within the same group is unlikely to have an adverse 
effect on the biological activity of the protein. 

10 Homology between nucleotide sequences can be determined by DNA hybridization 

analysis, wherein the stability of the double-stranded DNA hybrid is dependent on the extent 
of base pairing that occurs. Conditions of high temperature and/or low salt content reduce the 
stability of the hybrid, and can be varied to prevent annealing of sequences having less than 
a selected degree of homology. For instance, for sequences with about 55% G - C content, 

15 hybridization and wash conditions of 40 - 50 C, 6 X SSC (sodium chloride/sodium citrate 
buffer) and 0.1% SDS (sodium dodecyl sulfate) indicate about 60 - 70% homology, 
hybridization and wash conditions of 50 -65 EC, 1 X SSC and 0.1% SDS indicate about 82- 
97% homology, and hybridization and wash conditions of 52 C, 0.1 X SSC and 0.1 % SDS 
mdicate about 99 -100% homology. A wide range of computer programs for con^aring 

20 nucleotide and amino acid sequences (and measuring the degree of homology) are also 
available, and a list providing sources of both commercially available and free software is 
foxmdinAusubel^/fl/. (1999). Readily available sequence corcqparison and multiple sequence 
alignment algorithms are, respectively, the Basic Local Alignment Search Tool (BLAST) 
(Altschul et aL, 1997) and ClustalW programs. BLAST is available on the Internet at 

25 htlp://www.ncbi,nlm.nih.gov and a version of ChistalW is available at http://www2.ebi.ac,uk. 

huhistrial strains of microorganisms (e.g., Aspergillus lUger, Aspergillus ficuum, 
Aspergillus awamori^ Aspergillus oryzae^ Trichoderma reesei, Mucor ndehei^ Ktuyveromyces 
lactis, Pichia pastoris^ Saccharomyces cerevisiae, Escherichia coti^ Bacillus subtilis or Bacillus 
30 lichenifomds) or plant species (e.g., canola, soybean, com, potato, barley, rye, wheat) may 
be used as host cells for the recombinant production of the mutant MHC protems of the present 
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invention. As the first step in the heterologous expression of a high affinity MHC protein or 
soluble protein, an expression construct is assembled to include the MHC or soluble MHC 
coding sequence and control sequences such as promoters, enhancers and terminators. Other 
sequences such as signal sequences and selectable markers may also be included. To achieve 

S extracellular expression of a soluble MHC polypeptide, the expression construct may include 
a secretory signal sequence. The signal sequence is not included on the expression construct 
if cytoplasmic expression is desired. The promoter and signal sequence are functional in the 
host cell and provide for expression and secretion of the MHC or soluble MHC protein. 
Transcriptional terminators are included to ensure efficient transcription. Ancillary sequences 

1 0 enhancing expression or protein purification may also be included in the expression construct. 

Various promoters (transcriptional initiation regulatory region) may be used according 
to the invention. The selection of the appropriate promoter is dependent upon the proposed 
expression host. Promoters from heterologous sources may be used as long as they are 
IS functional in the chosen host. 

Promoter selection is also dependent upon the desked efficiency and level of peptide 
or protein production. Inducible promoters such tac are often enq)loyed in order to 
dramatically increase the level of protein expression E. coU. Overexpression of proteins may 
20 be harmful to the host cells. Consequently, host cell growth may be limited. The use of 
inducible promoter systems allows the host cells to be cultivated to acceptable densities prior 
to induction of gene expression, thereby facilitating higher product yields. 

Various signal sequences may be used according to the invention. A signal sequence 
25 which is homologdus to the MHC coding sequence may be used. Alternatively, a signal 
sequence which has been selected or designed for efficient secretion and processing in the 
expression host may also be used. For example, suitable signal sequence/host cell paurs 
include the J}. subtiUs sacB signal sequence for secretion in J?, subtitis, and the Saccharomyces 
cerevisiae a-mating factor or P. pastoris acid phosphatase phol signal sequences for P. 
30 pastoris secretion. The signal sequence may be joined direcfly through the sequence encoding 
the signal peptidase cleavage site to the protein coding sequence, or through a short nucleotide 
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bridge consisting of usuaUy fewer than ten codons. where die bridge ensures correct reading 
frame of the downstream TCR sequence. 



Elements for enhancing transcrq>tion and translaaonhavebeenidentified for eukaryotic 
protein expression systems. For exan5)le, positioning the cauKflower mosaic virus (CaMV) 
promoter 1000 bp on either side of a heterologous promoter may elevate transcriptional levels 
by 10- to 400-fold m plant cells. The expression construct should also include the appropriate 
translational initiation sequences. Modification of the e3q)ression construct to mclude aKozak 
consensus sequence for proper ttanslational initiation may increase the level of translation by 
10 fold. 

A selective marker is often employed, which may be part of the expression construct 
or separate from it (e.g., carried by the expression vector), so that the marker may mtegrate 
at a site different from the gene of mterest. Bxan5)les inchide markers that confer resistance 
to antibiotics (e.g., bla confers resistance to anq>icillin for E. cott host cells, nptn confers 
kanamydn resistance to a wide variety of prokaryotic and eukatyotic ceUs) or that permit the 
host to grow on mhiimal medwm (e.g. , HIS4 enables P. pastoris or His" S. cerevisiae to grow 
in the absence of histidine). The selectable marker has its own transcriptional and translational 
initiation and termmation regulatory regions to aUow for independent expression of the marker. 
If antibiotic resistance is employed as amarker, the concentration of the antibiotic for selection 
will vary depending upon the antibiotic, generaUy ranging from 10 to 600 ng of the 
antibiotic/mL of medium. 

The egression construct is assembled by employing known rec(»nbinant DNA 
techniques (Sambrook et aL. 1989; Ausubel et al., 1999). Restriction en^srme digestion and 
ligation are the basic steps employed to join two fragments of DNA. • -The ends of the DNA 
fragment may require modification prior to ligation, and this may be accomplished by filling 
in overhangs, deletmg terminal portions of the fragment(s) with nucleases (e.g. , Exom), site 
directed mutagenesis, or by adding new base pairs by PGR. Polylinkers and adaptors may be 
enq)loyed to fecilitate joining of selected firagments. The e3q)ression construct is typicaUy 
assembled in stages en^loymg rounds of restriction, ligation, and transformation of E. coU. 
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Homlogons recombination strategies can be used with co^lectroporation of linear DNAs into 
yeast (see Fig. 9). Numerous cloning vectors suitable for construction of the expression 
construct are known m the art (XZAP and pBLUESCRIPT SK-1, Stratagene, LaJoUa, CA; 
pET, Novagen Inc. , Madison, WI; cited in Ausubel et cd., 1999) and the particular choice is 
not critical to the invention. The selection of cloning vector will be influenced by the gene 
transfer system selected for introduction of the expression construct into the host cell. At the 
end of each stage, the resulting construct may be analyzed by restriction, DNA sequence, 
hybridization and PGR analyses. 

The expression construct may be transformed into the host as the cloning vector 
construct, either linear or circular, or may be removed from the cloning vector and used as is 
or introduced onto a delivery vector. The delivery vector facilitates the introduction and 
maintenance of the expression construct in the selected host cell type. The expression 
construct is introduced into the host cells by any of a number of known gene transfer systems 
(e.g., natural competence, chemically mediated transformation, protoplast transformation, 
electroporation, biolistic transformation, transfection, or conjugation) (Ausubel etaL, 1999; 
Sambrook et al, 1989). The gene transfer system selected depends upon the host cells and 
vector systems xised. 

For instance, the expression construct can be introduced mto S. cerevisiae cells by 
protoplast transformation or electroporation. Electroporation of S. cerevisiae is readily 
accomplished, and yields transformation efficiencies comparable to spheroplast transformation. 

Monoclonal or polyclonal antibodies, preferably monoclonal, specifically reacting with 
a protein of interest may be made by methods known in the art. See, e.g., Harlow and Lane 
(1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratories; Goding (1986) 
Monoclonal Antibodies: Principles and Practice, 2d ed., Academic Press, New York; and 
Ausubel et al. (1999) Current Protocols in Molecular Biology, John Wiley & Sons, Inc. , New 
York. 
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MHC/peptide complexes of improved stability in cell-bound or soluble form which are 
characteristic of a particular neoplastic condition (cancer, tumor, or the like) or a particular 
virus infected cell or pathogen infected cell are useful, for example, as agonists of the immune 
system so that the neoplastic ceU, virus infected cell or pathogen infected cell is more 

5 efficiently targeted for rmoval by T cells of the immune system. The inq>roved MHC/p 
complexes can be labeled by joining, either covalently or noiK:ovalently, a substance which 
provides a detectable signal. Suitable labels mchide but are not Umited to radionuclides, 
enzjones, substrates, cofactors, inhibitors, fluorescent agents, chemituminescent agents, 
magnetic particles and the like. Additionally the MHC protein of the present invention can be 

10 coupled to a ligand for a second binding molecules: for example, the MHC protein can be 
biotinylated. Detection of the MHC protein or complex can be effected by binding of a 
detectable streptavidin (a streptavidin to which a fluorescent, radioactive, chenuluminescent, 
or other detectable molecule is attached or to which an enzyme for which there is a 
chromophoric substrate available). United States patents describing the use of such labels 

15 and/or toxic compounds to be covalentiy bound to the scTCR protein include but are not 
limited to Nos. 3,817,837; 3,850,752; 3,927,193; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; 4,331,647; 4,348,376; 4,361,544; 4,468,457; 4,444,744; 4,640,561; 4,366,241; 
RE 35,500; 5,299,253; 5,101,827; 5.059,413. Labeled MHC proteins or complexes can be 
detected using a monitoring device or method appropriate to the label used. Fhiorescence 

20 microscopy or fluorescence activated cell sortmg can be used where the label is a fluorescent 
moiety, and where the label is a radionuclide, gamma counting, autoradiography or liquid 
scintillation counting, for exan^le, can be used widi the proviso that the method is appropriate 
to the sample being analyzed and the radionuclide used. In addition, there can be secondary 
detection molecules or particle enq)loyed where there is a detectable molecule or particle which 

25 recognized the portion of the MHC protem which is not part of the binding site for the cognate 
TCR or other ligand or other ligand in the absence of a component as noted herein. The art 
knows useful compounds for diagnostic imaging in situ; see, e.g., U.S. PalentNo. 5,101,827; 
5,059,413. Radionuclides useful for therapy and/or koaging in vivo include "*Indnmi, 
^Rubidium, '^Iodine, "^Iodine, ^^lodine, ^GaUium, technetium. Toxins include diphtheria 

30 toxin, ricin and castor bean toxin, among others, with the proviso that once the TCR-toxin 
complex is bound to the cell, the toxic moiety is internalized so that it can exert its cytotoxic 
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effect. Immunotoxin technology is well known to the art, and suitable toxic molecules include, 
without limitation, chemotherapeutic drugs such as vindesine, antifolates, e.g. methotrexate, 
cisplatin, mitomycin, anthrocyclines such as daunomycin, daunorubicin or adriamycin, and 
cytotoxic proteins such as ribosome inactivating proteins (e.g., diphtheria toxin, pokeweed 
5 antiviral protein, abrin, ricin, pseudomonas exotoxin A or their recombinant derivatives . See, 
generally, e.g., Olsnes and Pihl (1982) Pharmac, Ther. 25:355-381 and Monoclonal 
Antibodies for Cancer Detection and Therapy, Eds. Baldwin and Byers, pp. 159-179, 
Academic Press, 1985. 

10 Table, high affinity MHC proteins specific for a particular ligand, e.g., a particular 

peptide, protein or cell ^e, are useful in diagnosing animals, including humans, believed to 
be suffering from a disease associated with the particular pMHC. The MHC molecules of the 
present invention are usefid for detecting T cells that are specific for essentially any antigen^ 
including, but not limited to, those associated with a neoplastic condition, an abnormal protein, f 

15 or an infection or infestation with a bacterium, a fungus, a vkus, a protozoan, a yeast, a^ 
nematode or other parasite. Stable, high affinity MHC proteins specific for a particular ligand 
can also be used to induce the activity of T cells against antigens if desirable. For example, 
there are many peptides that have been associated with neoplastic cells, abnormal proteins, 
bacteria, fimgi, a viruses, and protozoans whereby said peptides bind to a Class I MHC 

20 protein. Stable, high affinity MHC proteins in complex with these peptides could serve as 
vaccines^ against the diseases by inducing T cell activity. 

The high affinity MHC compositions can be formulated by any of the means known in 
the art. They can be typically prepared as injectables, especially for intravenous, 
25 intraperitoneal or synovial adnunistration (with the route determined by the particular disease) 
or as formulations for intranasal or oral administration, either as liquid solutions or 
suspensions. Solid forms suitable for solution in, or suspension in, liquid prior to injection or 
other administration may also be prepared. The preparation may also, for example, be 
emulsified, or the protein(s)/peptide(s) encapsulated in liposomes. 

30 
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The active ingredients are often mixed with excipients or carriers which are 
pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients 
include but are not limited to water, saline, dextrose, glycerol, ethanol. or the like and 
combinations thereof. The concentration of the MHC protein in injectable, aerosol or nasal 
formulations is usually in the range of O.OS to S mg/ml. Similar dosages can be administered 
to other mucosal surfaces. 

In addition, if desired, vaccines may contain minor amounts of auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which enhance 
the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are 
not limited to: aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutanme (thr-MDP); 
N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP); N- 
acetylmuramyl-L-alanyl-D-isoglutaminyl-L~alanine-2~(l * -2 '-dipalmitoyl-sn-glyce^o- 
3hydroxyphospho^yloxy)•^^hylamine (CGP 19835A, referred to as MTP-PE); and RIBI, which 
contains three components extracted from bacteria: monophosphoryl lipid A, trehalose 
dimycolate and cell wall skeleton (MPL 4- TDM + CWS) in a 2 % squalene/Tween 80 emulsion. 
Such additional formulations and modes of administration as are known in the art may also be 
used. 

The stable high affinity MHC proteins of the present invention and/or pMHC-binding 
fragments having primary structure similar (more than 90% identity) to the high affinity MHC 
proteins and which maintain flie high affinity for the cognate ligand may be formulated into 
vaccines as neutral or salt forms. Pharmaceutically acceptable salts include but are not limited 
to the acid addition salts (formed with free amino groups of the peptide) which are formed with 
inorganic acids, e.g., hydrochloric acid or phosphoric acids; and organic acids, e.g., acetic, 
oxalic, tartaric, or maleic acid. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases, e.g., sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and organic bases, e.g., isopropylamine, trimethylamine, 2-ethylamino-ethanol, 
histidine, and procaine. Alternatively, these stable high affinity MHC protems can be used as 
antagonists of an interaction between endogenous MHC proteins of similar specificity and flie 
cognate TCIR cells. 
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High affinity MHC proteins or complexes for therapeutic use, e.g., those conjugated 
to cytotoxic compounds are administered in a manner compatible with the dosage formulation, 
and in such amoxmt and manner as are prophylacticaiiy and/or therapeutically effective, 
according to what is known to the art. The quantity to be administered, which is generally in 
5 die range of about 100 to 20,000 fig of protein per dose, more generally in the range of about 
1000 to 10,000 jLig of protein per dose. Similar conq)Ositions can be administered in similar 
ways using labeled high affinity MHC proteins for use in imaging, for exanq)le, to detect 
deleterious cytotoxic T cells that are involved in autoimmune attacks and containing the 
cognate pMHCs. Precise amounts of the active ingredient required to be administered may 
10 depend on the judgment of the physician or veterinarian and may be peculiar to each 
individual, but such a determmation is well within the skill of such a practitioner. 

The vaccine or other immunogenic composition may be given in a single dose; two dose 
schedule, for example two to eight weeks apart; or a multiple dose schedule. A multiple dose 

15 schedule is one in which a primary course of vaccination may iuclude 1 to 10 or more separate 
doses, followed by other doses administered at subsequent time intervals as required to 
maintaia and/or reinforce the immune response, e.g., at 1 to 4 months for a second dose, and 
if needed, a subsequent dose(s) after several months. Humans (or other animals) immunized 
with the retrovirus-like particles of the present invention are protected from infection by the 

20 cognate retrovirus. 

Standard techniques for cloning, DNA isolation, amplification and purification, for 
ensgrmatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and 
the like, and various separation techniques are those known and commonly employed by those 

25 skilled in the art. A number of standard techniques are described m Sambrook et al. (1989) 
Molecular Cloning, Second Edition. Cold Spring Harbor Laboratory, Plamview, New York; 
Maniatis et al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New 
York; Wu (ed ) (1993) Meth, EnzymoL 218, Part I; Wu (ed.) (1979) Meth En^mol 68; Wu 
et al. (eds.) (1983) Meth. EnzymoL 100 and 101; Grossman and Moldave (eds.) Meth, 

30 EnzymoL 65; Miller (ed.) (1972) E^eriments in Molecular Genetics, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York; Old and Primrose (1981) Principles of Gene 
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Manipulation, University of California Press, Berkeley; ScMeif and Wensink (1982) Practical 
Methods in Molecular Biology; Glover (ed.) (1985) DMA Cloning Vol. I and H, IRL Press, 
Oxford, UK; Hames and Higgins (eds.) (1985) Nucleic Acid Hybridization, IRL Press, Oxford, 
UK; and Settow and Hollaender (1979) Genetic Engineering: Principles and Methods, Vols. 
5 1-4, Plenum Press, New York. Abbreviations and nomenclature, where employed, are deemed 
standard in the field and commonly used in professional journals such as those cited herein. 

AU references cited in the present application are incorporated by reference herein to 
supplement the disclosure and experimental procedures provided in the present Specification 
10 to the extent that there is no inconsistency with the present disclosure. 

The following exanq>les are provided for illustrative purposes, and are not intended to 
limit the scope of the invention as claimed herein. Any variations in the exenq)lified articles 
and/or methods which occur to the skilled artisan are intended to Ml within the scope of the 
15 present invention. 

EXAMPLES 

Example 1 . Fusion of the MHC a chain to the p2m in the Yeast Display Vector pCT302 
20 The mouse MHC K^ a chain was fused to the mouse P2m [Mottez et al. (1995) /. 

Med. 181(2), 493-502]. The K** gene was PCR amplified usmg primers with Nhe I and 4^ U 
restriction sites. The S' primer contained a 30 bp linker upstream of the gene and the 3' 
primer contained a 45 bp connecting linker (underlined) downstream of the gene (5* CAA TGG 
CT A GCG GTG GAG TT A AGG GTG GAG C AG GTG GAG GTT GAG GAG GTG GAG 
25 GCC CAC ACT CGC TGA GGT ATT TCG T 3', SEQ ID N0:7; and 5' TGA ACC TCC 
GCC TCC TGA TCC ACC GCC ACC TGA ACC TAT TCC ACC CTC CCATCT CAG 
GGTGAGGGGCTCAGG3',SEQIDN0.8). TheP2mgene was PGR amplified with the 
5' prhner contaming the overlapping 45 bp linker (underlined) upstream and ibt 3* primer 
containmg a c-myc epitope tag and unique A7k> I site downstream of the P2m f5' GGT GGA 
30 ATA GGT TC A GGT GGC GGT GGA TCA GGA GGC GGA GGT TCA ATC CAG AAA 
ACC CCT CAA ATT CAA GTA T 3', SEQ ID N0:9, and 5 'GTT CCC TCG AGC TAT 
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TAG AAG TCT TCT TCA GAA ATA AGC TTT TOT TCC ATG TCT CGA TCC CAG 
TAG ACG GT 3', SEQ ID NO;10). Using PGR "sewing" of the overlapping linker [Davis 
et al. (1992) Biotedmology 9(2), 165-169], an amplified K?/p2in fusion was generated using both 
the K?* and p2m PGR products, primers 1 and 4 and TaqPlus Precision PGR System 
(Stratagene, La Jolla, CA). The L''/p2m MHG gene was fused and PGR anq)lified in an 
analogous manner. The K''/p2m and L''/p2mPGR products were digested mihNhe VXho I and 
ligated into die yeast surface display vector pGT302 containing a nine-residue q)itope tag (HA) 
and die AGA2 open reading firame downstream of the inducible GALl promoter ^oder and 
Wittrup (1997) supra]. 

Example 2. Peptide/MHC Construction. 

The AGA2 gene was cloned at the GOOH terminus of the peptide/K''/p2m gene in order 
to allow a free NH2 terminus of the peptide. First, the K''/p2m signal sequence was PGR 
amplified with EcdBI and Nhe I restriction sites upstream and downstream of the signal 
sequence (SS) (5* TCT CAA GAATTG TAG TTG ATA CAT TTT 3', SEQ ID NO:ll; and 
5' GTA TCT OCT AGC TGC TAA AAG TGA AGC 3'. SEQ ID NO:12). The SS PGR 
product was digested Nhe VEco RI and ligated into the iVfte VEco RI digested pCrr302 vector. 
The resulting plasmid contained die SS but not die AGA2 gene. Secondly, a SIYRYYGL 
(SEQ ID N0:1) (SmQ/K'/plm PGR product was graierated wifli a SIYR encoded primer 
(underlined) (5 ' ATA GTA GGT AGC TTG GAT AAA AGG TCT ATT TATAGATAT 
TAT GGT TTG GTT AAG GGT GGA GGA GGT GGA GGT 3' , SEQ ID NO:13; and 5' 
CAA TCC AGA TCT TTA GTA ATG GAA GTG TTG TTG AGA AAT AAG 3', SEQ ID 
N0:14). Nhe I and Nde I restriction sites are located m the upstream and downstream primers 
respectively. The SIYR/KVp2m PGR product was digested wifli Nhe I and Nde I and Ugated 
into the JVSte VNde I digested pGT302 SS vector (now called SS-SIYR K" pGT302). Finally, 
AGA2 was PGR amplffied (5* GGA TAT GAT ATG CAG GAA GTG AGA ACT ATA3', 
SEQ ID N0:15, and 5* ATT TGC AGA TCT CGA GTT ACT AAG GGT AGT GTG GAA 
GGT GGT A 3', SEQ ID NO:16), digested wiOiNde I andXfto I and Ugated into the iNSfc 
I/Zfiol digested SS-SIYR K* pCT302. The resulting construct contained tiie following order 
of genes in the pCT302 bacHione: SS-SIYR K!'/p2m-AGA2. 
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Example 3. OVA and dEVS Peptide Loading. 

The sense and anti-sense oligonucleotide sequences for both OVA (SHNFEKL, SEQ 
ID N0:2) and dEV8 (EQYKFYSV, SEQ ID NO:5) with a iVTi^ I and Aft E site upstream and 
downstream respectively were each phosphorylated wifli T4 DNA kinase (OVA 5' CT AGC 
TTG GAT AAA AGG AGC ATC ATC AAT TTT GAA AAG CTT C3', SEQ ID N0:17; and 
5'TT AAG AAG CTT TTC AAA ATT GAT GAT GCT CCT TTT ATC CAA G3', SEQ ID 
N0:18; dEV8 5' CT AGC TTG GAT AAA AGG GAA CAA TAC AAA TTC TAC TCA 
GTT C3' , SEQ ID N0:19; and 5'TT AAG AAC TGA GTA GAA TTT GTA TTG TTC CCT 
TTT ATC CAA G3', SEQ ID NO:20). The sense and anti-sense phosphorylated 
oligonucleotides were mixed (400 pmol), heated at lOO'C for 1 min and cooled slowly. 
Phosphorylated peptide cassettes were Ugated into Me I/4/?ndigestBdSIYR/K''/p2mpCT302. 

Ugation reactions were transformed into DHIOB electrocompetent E. coli ceUs and plated on 
LB/amp and incubated for 15 hrs at 37 C. Transformants were screened, and positive clones 
were sequenced. The resulting plasmids contained OVA and dEV8 tethered to K''/p2m. 

Example 4. Transformation into Yeast. 

The resultant MHC nucleotide constructs were transformed by the lithram acetate 
(LiAc) transformationmelhod [Geilzetal. (1995) Yeast 11. 355-360] into the S. ceremiae strain 
BJ5465 (a ura3-52 trpl leu2Al his3A200pep4::HIS2prbA1.6canl GALiYe&st Genetic Stock 
Center, Berkeley, CA) containing a chromosomaUy integrated AGAl -coding sequence 
expressed under the control of the GALl promoter (strain EBYIOO; Boder and Wittnip (1997) 
. supra). 

Exanq>le5. Induction and Detection of MHC on the Yeast Surface. 

Yeast cells transformed wiflipCT302/MHC plasmid constructs were grown overnight 
atSO Cwithdiakingin2mLselectiveglucosemediumSI>CAA(gI^^ 
nitrogenbase 0.67 wt %, cas?mino acids 0.5 wt %). After 18-24 hours, recombinant AGAl 
+ AGA2^MHC I expression was induced at 20»C with shaking in 5 mL selective galactose 
medmm (SG<:AA, where 2% galactose replaces the glucose in SI>CAA). Cultures were 
harvested after 24-48 hours (1-2 doublings) by centrifiigation, washed with PBS (10 mM 
NaP04, 150 mM NaCl, pH 7.3) containing 0.5% bovine serum albumin and incubated 45 
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minutes on ice with 25 nL of an anti-MHC antibody, anti-c-myc Mab 9E10 (1:100 dilution of 
raw ascites fluid; Berkeley Antibody Co. , Richmond, CA), or anti-HA Mab 12CA5 (10 /ig/ml. 
Bodiringer Mannheim, Indianapolis, IN). Cells were washed with PBS and incubated 30 
minutes on ice with either FTTC-labeled goat anti-mouse IgG (1:50; Kirkegaard and 
Peny Labs, Inc., Gaiflwrsburg, MD) or a streptavidin-pl^coerythrin (SA-PE) conjugate 
(1:200; PharMmgen, San Diego, CA). Labeled yeast cells were analyzed on a Coulter Epics 
XL flow cytometer. Data for 10,000-20,000 events were collected, and the population was 
gated accordmg to light scatter (size) to prevent analysis of cell clumps. 

Example 6. Random Mutagenesis of dEV8/K''/p2m and SIYR/ K''/p2m. 

dEV8/K''/p2m and SIYR/K''/p2m genes were randomly mutagenized using a PGR error 
prone technique. dEV8/K''/p2m and SIYR/KVp2m were PCR amplified using vector specific 
primers at least 50 bp upstream and downstream of each gene with PCR conditions that cause 
random mutations to be mserted by Taq polymerase (GBCO/BRL, Invitrogen,Carelsbad. 
CA). At a Mn:Mg ratio of about 0.16:1, the polymerase is more susceptible to inserting a 
random nucleotide during elongation. Using homologous recombination [Raymond et al. 
(1999) Biotechmques 26(1): 134-138. 140-141], the dEV8/KVp2m and SiyR/KVp2m pCT302 
vectors digested with JV?i« and Ndel were combined with flie error prone PCR products and 
electroporated (Bio-Rad Gene-Pulser H, 1.5V. 25mF, 0.2cm gene pulser cuvettes) into 40 M 
of electrocompetent S. cerevisiae cells (BJ5465, strain EBYIOO). The resulting 
transformations (separately for the two chimeric genes) are pooled, and a dilution is plated on 
SD-CAA plates. Plates are incubated at 30 C for 3 days, and the library size is tabulated. 

Example?. Cell Sorting. 

The yeast Ubraiy was grown m SD-CAA (2% dextrose, 0.67 % yeast nitrogenbase, 1 % 
Casanuno acids (Difco, DetroU, MD) at 30 C to an OD«o=4.0. To uiduce surface scTCR 
e!q)ression, yeast cells were pelleted by centrifugation, resuspended to an ODmo=' 10 in SG- 
CAA (2% galactose, 0.67% yeast nitrogen base, 1 % casamino adds), and incubated at 20 C 
for 48 hr. In general, 10' cells/tube were incubated on ice for 1 hr with 50 /d of 
biotinylated anti-K* antibody (B.8.24.3) dihited in phosphate bufBered saline (pH 7.4) 
supplemented with 0.5 mg/mlBSA(PBS-BSA). After incubation, cells were washed and 
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labeled for 30 min with SA:PE in PBS-BSA. Yeast cells were then washed and resuspended 
in PBS-BSA immediately prior to sorting. Cells exhibiting the highest fluorescMice were* 
isolated by FACS sorting with a Coulter 753 bench. After isolation, sorted ceUs were 
expanded in SD-CAA and induced in SG-CAA for subsequent rounds of selection. Three 
sequential sorts were performed for each mutant preparation witii increasingly dilute anti-Kf*. 
The percentages of total cells isolated from each sort were 1.0, 0.25 and 0.1%, respectively. 
Aliquots of the tiiird sorts were plated on SD-CAA to isolate uidividual clones, which were 
tiien analyzed by flow (^tometry usmg a Coulter Epics XL instrumrait. The clones with the 
highest fluorescence have then DN A rescued with flie Zymoprep Yeast Plasmid Mmiprep Kit 
(ZymogenResearch. Orange. CA). TheDNAisretransformedmtoDHlOBelectrocompetent 
E. cott, nuni prepped and submitted for sequencmg. The sequences are analyzed, and the 
mutations are located. 



Table 1. 
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Table 2. 

Various Single-Chain Class I MHC Constructions 
Ooned in pCT302 Yeast Display Ved»r 

Nucleic Acid Sequence 1 : SS-AGAl-K^/pam-c-myc (SEQ ID NO:22) 

ATGCAGTTACTTCGCTGTTTTTCAATATTTTCTGTTATTGCTTCAGTTTTAGC 

ACAGQAACTGACAACTATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGA 

C6CCGTACTCTTTGTCAACGACTACTATTTTGGCCAACG6GAAGGCAATGCAA 

GGAGTTTTTGAATATTACAAATCAGTAACGTTTGTCAGTAATT6CGGTTCTCA 

CCCCTC2^CAACTA6aUAGGCy^GCCCC^TA3ACACACAGTATGTTTTTAAGG 

ACAATAGCTCGACGATTGAAGGTAGATACCCATACGACGTTCCAGACTACGCT 

CTGCAGGCTAGTGGTGGTGGTGGTTCTGGTGGTGGTGGTTCTGGTGGTGGTGG 

TTCTGCTAGCGGTGGACTTAAGG6TGGACCAGGTGGAGGTTCAGGAGGTGGAG 

GCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTCGGG 

GAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCGCTT 

CGACAGCGACGCGGAGAATCCGA6ATATGAGCCGCGGGCGCGGTGGATGGA6C 

AGGAGG6GCCCGAGTATTGGGA6CGGGAGACACAGAAAGCCAAGGGCAATGAG 

CAGAGTTTCCOAGTGQACCTGAGGACCCTGCTCGGCTACTACl^CCAGAGCAA 

GGGCGQCTCTCACACTATTCAGGTGATCTCTGGCTGT6AAGTGGGGTCCGACG 

GGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTACATC 

GCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCTGAT 

CACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCaOGGCCTACC 

TGGAGGGCACGTGCGTGGA6TG6CTCCGCAGATACCTQAAGAACG6GAACOCG 

ACGCTGCTGC6CACAGATTCCCCAAAGGCCCATGT6ACCCATCACAGCAGACC 

TGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTGACA 

TCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAGCTT 

GTGGAGACCAGGCCTGCAGGGQATGGAACCTTCCAGAAGTGGGCATCTGTGGT 

G6TGCCTCTTGGGAAGGAGCAGTATTAa^Ca.TGCCATGTGTACCATCAGGGGC 

TGCCT6AGCCCCTCACCCTQAGATGGGAGGGTG6AATAGGTTCAGGTGGCGGT 

GQATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAA6TATACTC 

ACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAACAC 

AGTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAAAAAATT 

CCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTATAT 

CCTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCAGAG 

TTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAaACATG 

GAACAAAAGCTTATTTCJraAAaAAaACTTGTAATAaCTCQAG 
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Amino Acid Sequence 1: SS-AGA2-K'/p2m-c-OTyc (SEQ ID NO:23) 

MQLLRCFSIFSVIASVLAQELTTICEQIPSPTLESTPYSLSTTTILANGKAMQ 
GVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTIEGRYPYDVPDYA 
LQASGGGGSGGGGSGGGGSAS6GLKGGPGGGSGGGGPHSLRYFVTAVSRPGLG 
EPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWMEQBGPBYWERETQKAKQNE 
QSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVQSDCaiLLRGYQQYAYDGCDYI 
ALNEDLKTWTAZkDMAALITKHKWEQAGEAERLRAYLEGTCVEWLRRYLKNGllA 
TLLRTDSPKl^HVTHHSRPEDKVTLRCWALGFYPADITLTWQLNGEELIQDMEL 
VETRPAGDGTFQKWASVWPLGKEQYYTCHVYHQGLPEPLTLRWEGGIGSGGG 
GSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVTQFHPPHIEIQMLKNGKKI 
PKVEMSDMSFSKDWSFYILAHTEFTPTETDTYACRVKHDSMAEPKTVYWDRDM 

EQKLISEEDL* *LE 



Nucleic Acid Sequence 2: SS-SiyR/KVp2in-c-myc-AGA2 (SEQ ID NO:24) 

t 

GAATTCTACTTCa.TACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTa^ 

ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGATCTAT 

TTATAGATATTATGGTTTGCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 

GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCC6TGTCCCGGCCCGGCCTC 

6GGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 

CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 

AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 

GAGO^GAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGT6AAGTGGGGTCCG 

ACGGGCGACTCCTCCGCG6GTACCAGCAGTACGCCTACGACGGCTGCGATTAC 

ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 

GATCACCl^CACAAGTGGQAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 

ACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 

GCGACGCTGCTGCOCACAGATTCCCCaUVAGGCCCATGTGACCCa^TCACAGCAG 

ACCTGAAGATAAAGTCACCCTGA0GTGCTGGGCCCTGGGCTTCTACCCTGCT6 

ACATCa^CCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAG 

CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAA6TGGGCATCTGT 

GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 

GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTG6AATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGZ^AAACCCCTCAAATTCaiAGTATA 

CTCACQCCACCCACCGQA6AATGGGAAaCCGAAC»TACT6AACT0CTACGTAA 

CACAGTTCCACCCGCCTCaiCATTaAAATCCAAATGCTGAAGAACGGGAAAAAA 

ATTCCTAAAGTAGAGATGTCAQATATGTCCTTCAGCAAGGACTGGTCTTTCTA 
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TATCCTGGCTCACACTGAATTCACCCCGACTGAGACTGATACATACGCCTGCA 
GA6TTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGAC 
ATGGAAOyVAAGCTTATTTCTGAAGl^GACTTGCATATGCAGGAACTGACAAC 
TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTTGT 
CAAC6ACTACTATTTTGGCCAACGGGAAGGCAATGCAAGGAGTTTTTGAATAT 
TACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 
CAAAGGCAGCCCCATAAACACACAGTATGTTTTTAAGGACAATAGCTCGACGA 
TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAQTAACTCGAG 



Amino Acid Sequence 2: SS-SIYR/K!'/p2m-o-nrvc-AGA2 (SEQ ID NO:25) 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKRSIYRYYGLIiKGGPGGGSGGG 
GPHSLRYFVTAVSRP6LGEPR"XMEV6YVDDTEFVRPDSDAENPRYEPRARWME 
QEGPEYWERETQKAKOTEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAY 
LEGTCVEWLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
ITLTWQLNGEELIQDMELVETRPAGDGTFQBCWASWVPLGKEQYYTCHVYHQG 
LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 
QFHPPHIEIQMLKNGKKIPKVEMSDMSPSKDWSFYILAHTEFTPTETDTYACR 
VKHDSMAEPKTVXWDRnMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 
TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVPKDNSSTI 

EGRYPYDVPDYA 



Nucleic Acid Sequence 3: SS-<iEV8/K''/p2m-c-in)«:-AGA2 (SEQ ID NO:26) 

GAATTCTACTTCATACATTTTCAATTAAGAT^GCAGTTACTTCGCTGTTTTTCA 
ATATTTTCT6TTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGAGAACA 
ATACAAATTCTACTCa^QTTCTTAAGGaTGGACCAaGTQGAGGTTCa^GGAGaTO 
GAQGCCCACACTCaCTQAGQTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 
GGGGAGCCCCGOTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTC6ACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGQATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
QAGCAGAGTTTCCGAGT6GACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 
CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAA6TGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCAGCAGTAC6CCTACGACG6CTGCQATTAC 
ATCGCCCTOaACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 
GATCACCAAACACa^GTGGGAGCAGGCTGGTGAAGGAGAGAGACTCAGGGCCT 
ACCTGGAGG6CACGTGC0TGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 
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GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGG6CTTCTACCCTGCTQ 

ACATCACCCTQACCTGGCAGTTGAATGGGGAGGAGCT6ATCCAGGACATGGAG 

CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGT 

GQTQ6TGCCTCTTGGGAA6GAGCA6TATTACACATGCCATGTGTACCATCAGG 

GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATA 

CTCACGCCACCCACCGGAGAATGGGAAGCC6AACATACTGAACTGCTACGTAA 

CACAGTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAIU^ 

ATTCCTAAAGTAGAGATGTOlGATATGTCOTCAGaVAGaACTGGTCTTTCTA 

TATCCTGGCTC!ACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCA 

GAGTTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATC6AGAC 

ATGGAACAAAAGCTTATTTCTGAAGAAGACTTGCATATGCAGGAACTGAC2VAC 

TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTT6T. 

CAACGACTACTATTTTGGCOL^CGGGAAGGCAATGCAAGGAGTTTTTGAATAT 

TAC2^Ta^GTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 

CAAAQQCAGCCCCATAAACACaiCAGTATGTTTTTAAGGACAATAGCTCGACGA 

TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAGTAACTCGAG 



Amino Acid Sequence 3: SS-dEV8/K'/p2m-o-OTyc-AGA2 (SEQIDNO:27) 

ILLHTFSIKMQLLRCPSIPSVIASVLAASLDKREQYKFYSVLKGGPGGGSGGQ 
GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTBFVRFDSDAENPRYEPRARWME 
QEGPEYWERETQKAKGNEQSPRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEABRLRAY 
LEGTCVEWLRRYLKNGNATLIJlTDSPKAHVTHHSRPEDKVTLRCmLGFYPAD 
ITLTWQLN6EELIQDMELVBTRPAGDGTFQKWASVVVPL6KEQYYTCHVYHQG 
LPEPLTLRWBGOIGSGGQGSGQGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 
QPHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSFYILAHTEFTPTETDTYACR 
VKHDSMAEPKTVYWDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 
TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 

EGRYPYDVPDYA 



Nucleic Acid Sequence 4: SS^VA/K^/p2m-o-»xyc-AGA2 (SEQ ID NO: 28) 

GAATTCTACTTCATAOlTTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTa^ 
ATATTTTCTGTTATTGCOTCAGTTTTAGCa^GCTAGCra 

CATCAATTTTGAAAAGCTTCTTAAGGGTGGACCAGGTGGAGGTTCa^GGAGGTG 
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GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 

GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 

CTTCGACAGCGACGCGGAGAATCCGAGATATGA6CCGC6G6C6CG6TGGAT6G 

AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 

GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 

ACGGGC6ACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 

ATCGCCCTGAACGAAGACCTGAAAAC6TGGACGGCGGCGGACATGGCGGCGCT 

GATC^CCAAACACAAGTGGGAGGi^GGCTGGTGAAGCAGAGAGACTCAGGGCCT 

ACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 

GCGACGCTGCTGCGCACa^GATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 

ACATCACCCTGACCTGGCAGTTGAATG6GGAGGAGCTGATCCAGGACATG6AG 

CTTGTGGAGACCAGGCCTGq^GGGGATGGAACCTTCCAGAAGTGGGCATCTGT 

GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 

GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCATU^TTCAAGTATA 

CTCACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAA 

Ca^Ca^GTTCCIACCCGCCTCACaVTTGAAATCCAAATGCTGAAGAACGGGAAAAAA 

ATTCCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTA 

TATCCTGGCTCACACTOIUITTCACCCCCACTQAGACTGATACATACGCCTGCA 

GAGTTAAGCJATGACAGTATQGCCGAQCCCAAGACCGTCTACTGGGATCGAGAC 

ATGGAACAAAAGCTTATTTCT6AAGAAGACTTGCATATGCAGGAACTGACAAC 

TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTTGT 

CAAC6ACTACTATTTTGGCCAACGGGAAGGCAATGCAAGQAGTTTTTGAATAT 

TACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 

CAl^lGGCAGCCCCATAAACACACAGTATGTTTTTAAGGACAATAGCTCGACGA 

TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAGTAACTCQAG 



Amino Add Sequence 4: SS-OVA/KVp2m-c-»iyc-AGA2 (SEQ ID NO:29) 

ILLHTFSIKMQLLRCPSIFSVIASVLAASLDKRSIINPEKLLKGGPQGGSGGG 
GPHSLRYPVTAVSRPGLGBPRYMBVGYVDDTEPVRPDSDABNPRYEPRARWMB 
QEGPEYWERETQKAKGNBQSPRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAY 
LEGTCVEWLRRYIJCNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASWVPLGKEQYYTCaHVYHQG 
LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNIIJTCYVT 
QPHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSFYILAHTEPTPBTDTYACRV 
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KHDSMAEPKTVYWDRDMEQKLI SEEDLHMQELTTICEQIP SPTLESTPYSLST 
TTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTIE 
QRYPYDVPDYA 



Nucleic Acid Sequence 5: SS-AGA2-LVp2m-c-j?iyc (SEQ ID NO:30) 

ATGCa^GTTACTTCGCTGTTTTTCaUiiTATTTTCTGTTATTGCTTCAGTTTTAGC 

ACA6GAACTGACAACTATATGC6AGCAAATCCCCTCACCAACTTTAGAATCGA 

CGCCGTACTCTTTGTCa^CGACTACTATTTTGGCCAACGGGAAGGCAATGCAA 

GGAGTTTTTGAATATTACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCA 

CCCCTCAACy^CTAGCa^GGCAGdCCCyiTAAACyiCacaLGTATGTTTTTAAGG 

ACAATAGCTCGACGATTGAAGGTAGATACCCATACGACGTTCCAGACTACGCT 

CTGCAGGCTAGTGGTGGTQGTGGTTCGGTGGTGGTGGTTCTGGTGOTGQTGOT 

TCTGCTAGCGGTGGACTTAAGGGTGGACCAGGTGGAOGTTCAGGAGGTGGAGG 

CCCa^CACTCGATGCGGTATTTCGAGACCGCGGTGTCCCGGCGCGGCCTCGGGG 

AGCCCCGGTACATCTCTGTCGGCTATGTGl^CGACAAGGAGTTCGTGCGCTTC 

GACAGCGACGCGGAGAATCCGAGATATGAGCCGAGGGCGCCGTGGATGGAGCA 

GGAGGGGCCGGAGTATTGGGAGCGGATCACGCAGATCGCCAAGGGCCAGGAGC 

AGTGGTTCCGAGTGAACCTGAGGACCCTGCTCGGCTACTACAACCAGAGCGCG 

GGCGGCACTCACACACTCCAGTGGATGTACGGCTGTGACGTGGGGTCGGACGG 

GCGCCTCCTCCGCGGGTACGAGCAGTTCGCCTACGACGGCTGCGATTACATCG 

CCCTGAACGAAGACCTQiAAAACGTGGACGTTCGCGGACATGTCGTCGATGATC 

ACCCGACGCAAGTGGGAGCAGGCTGGTGCTGCAGAGTATTACAG6GCCTACCT 

GGAGGGCGAGTGCGTGGAGTGGCTCCACAGATACCTGAAGAACGGGAATGCTA 

CGCTGCTGCGCACAGATTCCCCAAAGGCACATGT6ACCTATCACCCCAGATCT 

AAAGGTGAAGTCACCCTGAGQTGCTGGGCCCTGGGCTTCTACCCTGCTGACAT 

CACCCTGACCTGGCAGTTQAATGGGGAGQAGCTGACCCAGGACATGGAGCTTG 

TGGAGACCAGGCCTGCAG6GGATGGAACCTTCCAGAAGTGGGCATCTGTGGTG 

GTGCCTCTTGGGAAGGAGCAGAATTACACAT6CCGTGTGTACCATGAGGGGCT 

GCCCCATCCCCTCACCCTGAGATGGGA6GGTGGAATAGGTTCAGGTGGCGGTG 

GATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCJ^TTCAAGTATACTCA 

CGCCACCCACCGGAGAATGGGAAGCC6AACATACTGAACTGCTACGTAACACA 

GTTCCaCCCGCCTCAOlTTGAAATCCaAATGCTGAAGAACGGGAAAAAAATTC 

CTAAAGTA6AGAT6TCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTATATC 

CTGGCTCa^a^CTGAATTa^CCCCCACTGAGACTGATACATACGCCTGCAGAGT 

TAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACT6GGATC6AGACATGG 

AACA2UkAGCTTATTTCTGAAGAAGACTTGTAATAGCTCGAG 
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Axnino Acid Sequence 5: SS-AGA2-L''/p2m-c-/nyc (SEQ IDN0:31) 

MQLLRCFSIFSVIASVLAQELTTICEQIPSPTLESTPYSLSTTTILJ^NGKAMQ 
GVFEYYKSVTFVSNCGSHPSTTSKGSPINTQyVFKDNSSTIEGRYPyDVPDYA 
S LQASGGGGSGGGGSGGGGSASGGLKGGPG6GSGGGGPHSMRYFETAVSRRGLG 
EPRYISVGYVNDKEFVRFDSDAENPRYEPRAPP9MEQEGPEYWERITQIAKGQE 
QWFRVNLRTLLGYYNQSAGGTHTLQTOfYGCDVGSDGRLLRGYEQFAYDGCDYI 
ALNEDLKTWTFADMSSMITRRKWEQAGAAEYYRAYLEGECVEWLHRYLKNGMA 
TLLRTDSPIU^HVTYHPRSKGEVTLRCWALGFYPADITLTWQLNQBELTQDMEL 
10 VETRPAGDGTFQKWASWVPLGKEQNYTCRVYHEGLPHPLTLRWEGGIGSGGG 
GSGG6GSIQKTPQIQVYSRHPPENGKPNILNCYVTQFHPPHIEIQMLKNGKKI 
PKVEMSDMSPSKDWSFYILAHTEFTPTETDTYACRVKHDSMAEPKTVYWDRDM 
EQKLISEEDL 

15 

Table 3. 

Sequences of mutant clones isolated by sorting firom 
dEV8/Eb error-prone PCR library. 

20 Nucleic Acid Sequence 1: dEP.l (SEQ ID NO:32) 

GAATTCTACTTCa^TACa^TTTTCAATTAAGATGCAGTTACTTCGCTGTTl^ 

ATATTTTCTGTTATT6CTTCAGTTTTAGCUVGCTAGCTTGGATAAAAGAGAACA 

ATACAAATTCTACTCAGTTCTTAA6GGTGGACCAGGTGGAGGTTCAGGAGGTG 

25 GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 
GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTCGACAGC6ACGCGGAGAATCCGAGATATGAGCCGCGGGCGCG6TGGATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
GAGCAGAGTTTCCGA6TGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

30 OU^GGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTQAAGTGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACQACGGCT6CGATTAC 
ATCGCCCTGAACGAAGACCT6AAAACGTGGACGGCGGCGGACATG6CGGCGCT 
QATCACCAAACACAAGTGGGAGCAGGCTGGTGl^GCAGAGAGACTCAGGGCCT 
ACCTGGAGG6CACGTGCGTGGAGA66CTCCGCAGATACCTGAAGAACGGGAAC 

35 GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 
ACCTGAAGATAAA6TCACCCTGAG6T6CTGGGCCCTG6GCTTCTACCCTGCTG 
ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCT 



40 
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Amino Acid Sequence 1: dEP.l (SEQ ID NO:33) 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKREQYKFYSVLK6GP6G6S666 
GPHSLRYFVTAVSRPGLGEPRyMEVGyVDDTEFVRFDSDAENPRYEPRARWME 
QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAY 
LEGTCV^RLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
ITLTWQIiNQEELIQDMELYETRPAGDGTFQKWASVWPLQKEQYYTCHVYHQG 
LPEPLTLRWEGGI6S6GG6SGGG6SIQKTPQIQVYSRHPPENGKPNILNCYVT 
QFHPPHIEIQMLKN6KKIPK\7EMSDMSFSKDWSFNIIAHTEFTPTETDTYACR 
VKHDSMAEPKTVYWDRDMEQRLISEEDLHMQELTTICEQIPSPTLESTPYSLS 
TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 2: dEP.3 (SEQ ID NO:34) 

GAATTCTACTTCATACATTTTCAATTAAGATGCA6TTACTTCGCTGTTTTTCA 
ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAQCTTGQATAA3AGAQAACA 
ATACAAATTCTACTCAQTTCTTAAGGGTGGACCaiGGTGQAaGTTCAQGAGGTG 
QAGQCCCACACTCGCTGAGGTATTTCGTCACCGCCGTQTCCCGGCCCGGCCTC 
GGQQAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTCGACAGCGACGCGGAGAATCCGAGATAT6AGCCGCGGGCGCGGTGGATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCJ^GGGCAAT 
GAGCAGAGTTTCCQAGTGGACCTGAGGACCCTGCTCG6CTACTACAACCA6AG 
CAAGGGCGGCTCTCACa.CTATTCAGGTGATCTCTGGCTGT62U^GTGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCA6CAGTACGCCTACGACGGCTGCQATTAC 
ATCGCCCTGI^CaAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 
GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAQAGAGACTCAGGGCCT 
ACCTGGAGGGCACGTGCGTGGAGCG6CTCCGCAGATACCTGAAGAACGGGAAC 
GCGACGCTGCTGCGCACAGATTCCCCAAAGG 



Amino Acid Sequence 2: dEP.3 (SEQ ID NO:35) 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKREQYKFYSVLKGGPGGGSGGG 
GPHSLRYFVTAVSRP6LGBPRYMEVGYVDDTBPVRFDSDAENPRYEPRARWMB 
QEGPEYWERBTQKAKGNEQSPRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLXJ^GYQQYAYDGCDYIALNEDIJCTWTAADMAJ^ITKHKWEQAGEAERLRAY 
LEGTCVERLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKinJLRCWALGPYPAD 
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ITLTWQLNGEELIQDMELVETRPAGD6TFQKWASVWPLGKEQYYTCHVYHQG 
LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 
QPHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSPYILAHTEFTPTBTDTYACR 
VKHDSMAEPKTVYWDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 
5 TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 3: dEP.4 (SEQ ID NO:36) 

10 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 
ATATTTTCTGTTATTGCTTCAGTTTTA6CAGCTAGCTTGGATAAAAGAGAACA 
ATACAAATTCTACTCAGTTCTTAA66GT66ACCAGGTGGAGGTTCA6GAGGTG 
GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 

IS G6G6A6CCCCGGTACATGGAAGTCG6CTACGTGGACGACACGGAGTTCGTGC6 
CTTCAACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTG6ATGG 
AGCAGGAGGGGCCCGAGTATT6GGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 
CAAGGGCG6CTCTCACACTATTCAG6TGATCTCTGGCTGTGAAGTGGGGTCCG 

20 ACGGGCGACTCCTCC6C6GGTACCAGCAGTACGCCTACGACG6CTGCGATTAC 
ATCGCCCTGAACGAAGACCTGZ^AAACGTGGACGGCGGCGGACATGGCGGCGCT 
GATCACCAAACACAA6TGGGAGCAGGCTGGTGAAGCAGAGA6ACTCAG6GCCT 
ACCTGGAGGGCACGTGCGTGGAGAGGCTCCGCAGATACCTGAAGAACGGGAAC 
GCGACGCT6CTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

25 ACCTGAAGATAAAGTCACCCTGA6GTGCTGGGCCCTGGGCTTCTACCCTGCTG 
ACATCACCCTGACCT6GCAGTTGAATG6GGAGGAGCTGATCCA6GACATGGAG 
CTTGTGGA6ACCAG6CCTGCAG6GGATGGAACCTTCCA6AAGT6GGCATCT6T 
GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 
GGCT6CCTGAGCCCCTCACCCTGAGATG6GA6GGT6GAATA6GTTCA66T66C 

30 GGTGGATCAGGAGGCGGAGGTTCAATCCAG2U^CCCCTC2U^TTCAAGTATA 
CTCACGCCACCCACC6GAGAATGGGAAGCCG 



Amino Acid Sequence 3: dEP.4 (SEQ E) NO:37) 

35 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKREQYKFYSVLKGGP6GGSGGG 
GPHSLRYFVTAVSRPGL6EPRYMEVGYVDDTEFVRFNSDAENPRYEPRARVVME 
QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYD6CDYIALNEDIiKTNTAADMA2U:.ITKHKWEQAGEAERIJUl^ 
40 LEGTCVERLRRYLIQVGMATLIJlTDSPKAHVTHHSRPEDKVTLRCmLGFYPAD 
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ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASVWPLGKEQYYTCHVyHQG 
LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 
QFHPPHIEIQMLKNQKKIPKVEMSDMSFSKDWSFNILAHTEFTPTETDTYACR 
VKHDSMAEPKTVYWDRDMEQRLZSEEDLHMQELTTICEQIPSPTLESTPYSLS 
5 TTTIIANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPIHTQYVFKDNSSTI 
EGRYPYDVPDYA 
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WHAT IS CLAIMED IS: 

1. A mutagenized combinatorial library of Major Histocompatibility Complex (MHC) 
Class I chimeric proteins displayed on the surfaces of recombinant yeast cells, wherein 
the mutagenized combinatorial library comprises at least one member MHC Class I 
chimeric protein which is improved in conformational stability or in specific target 
binding as compared with a comparison MHC Class I chimeric protein which has not 
been mutagenized. 

2. The mutagenized combinatorial library of claim 1 wherein die MHC Class I chimeric 
protein comprises a portion mediating binding to the surfaces of the recombinant yeast 
cells and a portion which comprises a specific target binding region of a MHC Class 
I protem. 

3 . The mutagenized combinatorial library of claim 2 wherein the portion mediated binding 
to the surfaces of the recond)]nant yeast cells is a matmg adhesion recq)tor portion. 

4. The mutagenized combinatorial library of claim 3 wherein the matmg adhesion recq)tor 
portion is an AGA2 portion. 

5. The mutagenized combinatorial library of any of claims 2 to 4 wherein the chimeric 
protein further comprises a portion characterized by an amino acid sequence of a 
peptide which binds to the bmding region of the MHC Class I chimeric protein. 

6. The mutagenized combinatorial library of any of claims 2 to 5 wherein the chimeric 
protein further comprises a portion derived from a c-myc protein and which mediates 
bmding to a c-myc specific antibody. 

7. The mutagenized combinatorial library of any of claims 2 to 6 wherein the binding 
region of the MHC Class I chimeric protein specifically binds a specific target selected 
from the group consisting of a neoplastic cell, a virus-infected cell, a fimgus-infected 
cell, a parasite-infected cell and a bacterium-infected cell. 



46 



wo 02/46399 



PCT/USOl/47817 



8. The mutagenized combinatorial library of claim 8 wherein the peptide binding region 
specifically binds a peptide having the amino acid sequence given in SEQ ID NO: 19, 
SEQ ID NO:22 or SEQ ID NO:24. 

9. The mutagenized combinatorial library of claim 8 wherem said chimeric protein 
comprises an amino acid sequence as given in SEQ ID NO: 17. 

10. An isolated mutant MHC Class I chimeric protein, wherein said protein conc^rises a 
portion mediating binding to the surfaces of the recombinant yeast cells and a portion 
which comprises a peptide binding region of a MHC Qass I protein and wherein said 
chimeric protein is improved in stability as compared with an MHC Class I chimeric 
protein which is not a mutant chimeric protein. 

11 . The isolated mutant MHC Class I chimeric protein of claim 10 wherein the chimeric 
protein further comprises a portion comprising an amino acid sequence of a peptide 
which binds to the peptide binding region of the MHC Class I protein. 

12. The isolated mutant MHC Class I chimeric protein of claim 10 wherein a peptide 
which binds to the peptide binding region of the MHC Class 1 protein is associated with 
a neoplastic or infectious disease. 

13. The isolated mutant MHC Class I chimeric protein of claim 10 wherein the pqptide 
bmding region specifically binds a peptide having the amino acid sequence given in 
SEQ ID NO: 19, SEQ ID NO:22 or SEQ ID NO:24. 

14. The isolated mutant MHC Class I chimeric protein of claim 1 1 wherein said diimeric 
protein further comprises a detectable label. 

15 . The isolated mutant MHC Class I chimeric protein of claim 15 wherein the detectable 
label is a fluorescent moiety, a chromophore, a radionuclide, a chemiluminescent agent, 
a magnetic particle, an enzyme, a cofactor, a substrate or a toxin. 

16. A method for detection of a lymphocyte having a T cell receptor protein in a biological 
sample, said method comprising the steps of contacting the sample with an isolated 
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mutant chimeric protein of claim 14, wherein said chimeric protein is complexed to the 
peptide or wherein the chimeric protein and peptide are covalently bound, wherein said 
chimeric protein comprises a binding region which specifically binds said T cell receptor 
protein under conditions which allow the binding of the T cell receptor protein to the 
chimeric protein, and detecting the chimeric protein bound to the T cell receptor protein. 

17. The method of claim 16 wherein the T lymphocyte is specific for a neoplastic cell, a 
tumor cell, a vims-mfected cell, a protozoan-infected cell, a bacterium-infected cell or 
a fungus-infected cell. 

18. The method of claim 16 or 17 whereia the biological sample is cells, a tissue sample, 
biopsy material or bodily fluids. 

19. A method for activating or enhancing an immune response to an abnormal cell selected 
from the group consisting of a neoplastic cell, a tumor cell, a virus-infected cell, a 
parasite-infected cell, a fungus infected cell or a protozoan infected cell in a human or 
animal, said method comprising the stqp of administering to the patient a therapeutically 
effective amount of aia isolated mutant MHC Class I chimeric protein or a mutant MHC 
Class I chim^c protem/peptide complex which is improved in conformational stability 
or in^proved in binding to T lymphocyte as compared with the MHC Class I chuneric 
protein which is not mutant, whereby the immune response in the human or animal is 
activated or enhanced. 



20. The method of claim 19 wherein the administering is by intravenous, intramuscular, 
intradermal, subcutaneous or intraperitoneal administration. 

21 . The method of claim 20 wherein said isolated mutant protein has a portion conqprismg 
an amino acid sequence as given in SEQ ID N0:17. 

22. The method of claim 21 wherein said mutant protein binds a peptide comprising an 
amino acid sequence as given in SEQ ID NO: 19, SEQ ID NO:22 or SEQ ID NO:24. 
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23. A combinatorial library of peptides anchored to the surface of yeast cells, each cell 
displaying a peptide of a unique sequence anchored to its surface, herein the peptide is 
anchored to the sxuface of the yeast ceil by a mating &ctor sequence. 

24. The combinatorial Ubrary of claim 23 wherein the mating factor sequence is an AGA2 
sequence. 

25. A method for identifying a peptide which binds specifically to an MHC protein 
comprising the step of contacting a detectable MHC protein wth the combinatorial 
Ubrary of clahn 24 or 25 under conditions which allow binding of the protein and the 
peptide, and detecting peptide bound to the MHC protein. 
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SEQUENCB LISTING 



<110> Board of Trustees of the University of Illinois 

<120> Mutated Class I Major Histocompatibility proteins and 
Complexes 

<130> 100-OOWO 

<140> Not assigned 
<141> 2001-12-10 ' 

<150> 60/254,495 
<151> 2000-12-08 

<160> 37 

<170> Patent In Ver. 2.0 

<210> 1 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 1 

Ser lie Tyr Arg Tyr Tyr Gly Leu 
1 5 

<210> 2 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 2 

Ser lie lie Asn Phe Glu Lys Leu 
1 5 



<210> 3 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



1 



wo 02/46399 



PCTAJSOl/47817 



<220> 

<221> Tnisc_feature 

<222> (1) . . (63) 

<223> N is A, T, G or C. 

<400> 3 

atattttctg ttattgcttc agttttagca gctagcttgg ataaaagaxm snnsnnsaaa 60 
ttc 63 

<210> 4 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 4 

Gin Leu Ser Pro Phe Pro Phe Asp Leu 
1 5 



<210> 5 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 5 

Glu Gin Tyr Lys Phe Tyr Ser Val 
1 5 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

\ 

<400> 6 

Asn Asn Asn Lys Phe Tyr Ser Val 
1 5 



<210> 7 
<211> 79 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



<400> 7 

caatggctag cggtggactt aagggtggac caggtggagg ttcaggaggt ggaggcccac 60 
actcgctgag gtatttcgt 79 

<210> 8 
<211> 72 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 8 

tgaacctccg cctcctgatc caccgccacc tgaacctatt ccaccctccc atctcagggt 60 
gaggggctca gg 72 

<210> 9 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



<400> 9 

ggtggaatag gttcaggtgg cggtggatca ggaggcggag gttcaatcca gaaaacccct 60 
caaattcaag tat 73 

<210> 10 
<211> 71 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



<400> 10 

gttccctcga gctattacaa gtcttcttca gaaataagct tttgttccat gtctcgatcc 60 . 
cagtagacgg t 71 



<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 11 

tctcaagaat tctacttcat acatttt 27 
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<210> 12 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 12 

gtatctgcta gctgctaaaa ctgaagc 27 

<210> 13 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of /Artificial Sequence: Synthetic 
nucleotide primer 

<400> 13 

atactagcta gcttggataa aaggtctatt tatagatatt atggtttgct taagggtgga 60 
ccaggtggag gt 72 

<210> 14 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 14 

caatccagat ctttactaat gcaagtcttc ttcagaaata ag 42 

<210> 15 
<211> 30 

<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



<400> 15 

ggatatcata tgcaggaact gacaactata 

<210> 16 
<211> 43 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 
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<400> 16 

atttgcagat ctcgagttac taagcgtagt ctggaacgtc gta 

<210> 17 
<211> 42 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 17 

ctagcttgga taaaaggagc atcatcaatt ttgaaaagct tc 

<210> 18 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 18 

ttaagaagct tttcaaaatt gatgatgctc cttttatcca ag 

<210> 19 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 19 

ctagcttgga taaaagggaa caatacaaat tctactcagt tc 

<210> 20 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 20 

ttaagaactg agtagaattt gtattgttcc cttttatcca ag 

<210> 21 
<211> 1632 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 21 

atgcagttac ttcgctgttt ttcaatattt tctgttattg cttcagtttt agcacaggaa 60 
ctgacaacta tatgcgagca aatcccctca ccaactttag aatcgacgcc gtactctttg 120 
tcaacgacta ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa 180 
tcagtaacgt ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc 240 
ataaacacac agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac 300 
gacgttccag actacgctct gcaggctagt ggtggtggtg gttctggtgg tggtggttct 360 
ggtggtggtg gttctgctag cggtggactt aagggtggac caggtggagg ttcaggaggt 420 
ggaggcccac actcgctgag gtatttcgtc accgccgtgt cccggcccgg cctcggggag 480 
ccccggtaca tggaagtcgg ctacgtggac gacacggagt tcgtgcgctt cgacagcgac 540 
gcggagaatc cgagatatga gccgcgggcg cggtggatgg agcaggaggg gcccgagtat 600 
tgggagcggg agacacagaa agccaagggc aatgagcaga gtttccgagt ggacctgagg 660 
accctgctcg gctactacaa ccagagcaag ggcggctctc acactattca ggtgatctct 720 
ggctgtgaag tggggtccga cgggcgactc ctccgcgggt accagcagta cgcctacgac 780 
ggctgcgatt acatcgccct gaacgaagac ctgaaaacgt ggacggcggc ggacatggcg 840 
gcgctgatca ccaaacacaa gtgggagcag gctggtgaag cagagagact cagggcctac 900 
ctggagggca cgtgcgtgga gtggctccgc agatacctga agaacgggaa cgcgacgctg 960 
ctgcgcacag attccccaaa ggcccatgtg acccatcaca gcagacctga agataaagtc 1020 
accctgaggt gctgggccct gggcttctac cctgctgaca tcaccctgac ctggcagttg 1080 
aatggggagg agctgatcca ggacatggag cttgtggaga ccaggcctgc aggggatgga 1140 
accttccaga agtgggcatc tgtggtggtg cctcttggga aggagcagta ttacacatgc 1200 
catgtgtacc atcaggggct gcctgagccc ctcaccctga gatgggaggg tggaataggt 1260 
tcaggtggcg gtggatcagg aggcggaggt tcaatccaga aaacccctca aattcaagta 1320 
tactcacgcc acccaccgga gaatgggaag ccgaacatac tgaactgcta cgtaacacag 1380 
ttccacccgc ctcacattga aatccaaatg ctgaagaacg ggaaaaaaat tcctaaagta 1440 
gagatgtcag atatgtcctt cagcaaggac tggtctttct atatcctggc tcacactgaa 1500 
ttcaccccca ctgagactga tacatacgcc tgcagagtta agcatgacag tatggccgag 1560 
cccaagaccg tctactggga tcgagacatg gaacaaaagc ttatttctga agaagacttg 1620 
taatagctcg ag 1632 

<210> 22 
<211> 540 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 22 

Met Gin Leu Leu Arg Cys Phe Ser He Phe Ser Val He Ala Ser Val 
15 10 15 

Leu Ala Gin Glu Leu Thr Thr He Cys Glu Gin He Pro Ser Pro Thr 
20 25 30 '* 

Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr He Leu Ala Aen 
35 40 45 

Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys Ser Val Thr Phe 
50 55 60 

Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys Gly Ser Pro 
65 70 75 80 
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He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr He Glu Gly 
85 90 95 

T^g Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Leu Gin Ala Ser Gly Gly 
100 105 110 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ser Gly 
115 120 125 

Gly Leu Lys Gly Gly Pro Gly Gly Gly Ser Gly Gly Gly Gly Pro His 
130 135 140 

Ser Leu Arg Tyr Phe Val Thr Ala Val Ser Arg Pro Gly Leu Gly Glu 
145 150 155 160 

Pro Arg Tyr Met Glu Val Gly Tyr Val Asp TVsp Thr Glu Phe Val Arg 
165 170 175 

Phe Asp Ser Asp Ala Glu Asn Pro Arg Tyr Glu Pro Arg Ala Arg Trp 
180 185 190 

Met Glu Gin Glu Gly Pro Glu Tyr Trp Glu Arg Glu Thr Gin Lys Ala 
195 200 205 

Lys Gly Asn Glu Gin Ser Phe Arg Val Asp Leu Arg Thr Leu Leu Gly 
210 215 220 

Tyr Tyr Asn Gin Ser Lys Gly Gly Ser His Thr He Gin Val He Ser 
225 230 235 240 

Gly Cys Glu Val Gly Ser Asp Gly Arg Leu Leu Arg Gly Tyr Gin Gin 
245 250 255 

Tyr Ala Tyr Asp Gly Cys Asp Tyr He Ala Leu Asn Glu Asp Leu Lys 
260 265 270 

Thr Trp Thr Ala Ala Asp Met Ala Ala Leu He Thr Lys His Lys Trp 
275 280 285 

Glu Gin Ala Gly Glu Ala Glu Arg Leu Arg Ala Tyr Leu Glu Gly Thr 
290 295 300 

Cys Val Glu Trp Leu Arg Arg Tyr Leu Lys Asn Gly Asn Ala Thr Leu 
305 310 315 320 

Leu Arg Thr Asp Ser Pro Lys Ala His Val Thr His His Ser Arg Pro 
325 330 335 

Glu Asp Lys Val Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala 
340 345 350 

Asp He Thr Leu Thr Trp Gin Leu Asn Gly Glu Glu Leu He Gin Asp 
355 360 365 



Met Glu Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gin Lys 
370 375 380 
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Trp Ala Ser Val Val Val Pro Leu Gly Lys Glu Gin Tyr Tyr Thr Cys 
385 390 395 400 

His Val Tyr His Gin Gly Leu Pro Glu Pro Leu Thr Leu Arg Trp Glu 
405 410 415 

Gly Gly He Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser He 
420 425 430 

Gin Lys Thr Pro Gin lie Gin Val Tyr Ser Arg His Pro Pro Glu Asn 
435 440 445 

Gly Lys Pro Asn He Leu Asn Cys Tyr Val Thr Gin Phe His Pro Pro 
450 455 450 

His He Glu He Gin Met Leu Lys Asn Gly Lys Lys He Pro Lys Val 
465 470 .475 480 

Glu Met Ser Asp Met Ser Phe Ser Lys Asp Trp Ser Phe Tyr He Leu 
485 490 ' 495 

Ala His Thr Glu Phe Thr Pro Thr Glu Thr Asp Thr Tyr Ala Cys Arg 
500 505 510 

Val Lys His Asp Ser Met Ala Glu Pro Lys Thr Val Tyr Trp Asp Arg 
515 520 525 

Asp Met Glu Gin Lys Leu He Ser Glu Glu Asp Leu 
530 535 540 



<210> 23 
<211> 1640 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 23 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag atctatttat agatattatg 120 
gtttgcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggagt 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagct tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtggtgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
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ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcggaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccaccggaga 1080 
atgggaagcc gaacatactg aactgctacg taacacagtt ccacccgcct cacattgaaa 1140 
tccaaatgct gaagaacggg aaaaaaattc ctaaagtaga gatgtcagat atgtccttca 1200 
gcaaggactg gtctttctat atcctggctc acactgaatt cacccccact gagactgata 1260 
catacgcctg cagagttaag catgacagta tggccgagcc caagaccgtc tactgggatc 1320 
gagacatgga acaaaagctt atttctgaag aagacttgca tatgcaggaa ctgacaacta 1380 
tatgcgagca aatcccctca ccaactttag aatcgacgcc gtactctttg tcaacgacta 1440 
ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa tcagtaacgt 1500 
ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc ataaacacac 1560 
agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac gacgttccag 1620 
actacgctta gtaactcgag 1640 

<210> 24 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 24 

lie Leu Leu His Thr Phe Ser lie Lys Met Gin Leu Leu Arg Cys Phe 
15 10 15 

Ser lie Phe Ser Val lie Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 
20 25 30 

Arg Ser He Tyr Arg Tyr Tyr Gly Leu Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp A1& Glu Asn Pro 
85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
100. 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 175 
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lie Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu lie Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp lie Thr Leu Thr Trp Gin Leu 
260 265 270 

Asn Gly Glu Glu Leu lie Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly lie Gly Ser Gly Gly Gly 
325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 
340 345 350 

Tyr ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 

Lys Asp Trp ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 
' 405 410 

Glu Thr ASP Thr Tyr Ala cys Arg val Lys His Asp Ser Met Ala Glu 
420 425 430 

pro Lys Thr val Tyr Trp ASP Arg Asp Met Glu Gin Lys Leu He Ser 
435 440 445 

Glu Glu ASP Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 



pro Ser pro THr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 



470 475 480 
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lie Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 
485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 
500 505 510 

Lys Gly Ser Pro lie Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 

Thr lie Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 25 
<211> 1640 
<212> msA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 25 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt ISO 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggagt 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagct tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtggtgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcggaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccaccggaga 1080 
atgggaagcc gaacatactg aactgctacg taacacagtt ccacccgcct cacattgaaa 1140 
tccaaatgct gaagaacggg aaaaaaattc ctaaagtaga gatgtcagat atgtccttca 1200 
gcaaggactg gtctttctat atcctggctc acactgaatt cacccccact gagactgata 1260 
catacgcctg cagagttaag catgacagta tggccgagcc caagaccgtc tactgggatc 1320 
gagacatgga acaaaagctt atttctgaag aagacttgca tatgcaggaa ctgacaacta 1380 
tatgcgagca aatcccctca ccaactttag aatcgacgcc gtactctttg tcaacgacta 1440 
ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa tcagtaacgt 1500 
ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc ataaacacac 1560 
agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac gacgttccag 1620 
actacgctta gtaactcgag 1640 

<210> 26 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 26 

He I.eu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 10 15 

Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 
20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

val ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

val ASP Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 
85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
XOO 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
lis 120 125 

val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 

'Ara Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 175 

He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Tip Leu Arg Arg Tyr 
210 215 220 



Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro' Lys Ala 
225 



230 235 240 



His val Thr His His ser Arg pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 
260 265 270 
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Asn Gly Glu Glu Leu lie Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 
325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 
340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn lie Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 
405 4X0 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 
420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 
485 490 \ 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 
500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
5X5 520 525 



Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 27 
<211> 1640 
<212> DKA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 27 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag gagcatcatc aattttgaaa 120 
agcttcttaa gggtggacca ggtggaggtt oaggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag oaggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cotgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattao atogccotga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcaco aaacacaagt 600 
gggagcaggo tggtgaagca gagagaotoa gggcctacct ggagggcacg tgcgtggagt 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagot tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtStgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcigaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccacoggaga 1080 
Itgggaagcc gaacatactg aactgctacg taacacagtt ccacccgcct cacattgaaa 1140. 
tccSatlct iaagaacggg aaaaaaattc ctaaagtaga gatgtcagat atgtccttca 1200 
gcaaggaotg gtotttctat atcctggctc aoactgaatt cacccccact gagactgata 1260 
cateScctg cagagttaag catgacagta tggccgagcc caagaccgtc tactgggatc 1320 
gScItggI acLIagct? atttctgaag aagacttgca tatgcaggaa ctgacaacta 1380 
SScgaIca aatcccctca ccaactttag aatcgacgcc gtactctttg tcaacgacta 1440 
cSttttggc caacgggaag gcaatgcaag gagtttttga atattacaaa tcagtaacgt 1500 
ttrtcagS^ ttgc^tc? cacccctcaa caactagcaa aggcagcccc ataaacacac 1560 
agStg?^ talgjcaat agctcgacga ttgaaggtag atacocatac gacgttccag 1620 
actacgctta gtaactcgag 

<210> 28 
<211> 541 
<212> PRT 

<213> Artificial Sequence 

<220> ^ 4.4^ 

<223> Description of Artificial sequence: Synthetic 

peptide 

Ile°Leu^Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 10 

ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 
20 25 30 

Arg ser He He Asn Phe Glu Lys Leu Leu Lys Oly Gly Pro Gly Gly 
35 40 45 

Gly ser Oly Gly oly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 
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Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

Val Asp Asp Tbr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 
85 90 95 

A3:g Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
100 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
X15 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr lie Gin Val lie Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 . 175 

lie Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu lie Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp lie Thr Leu Thr Trp Gin Leu 
260 265 270 

Asn Gly Glu Glu Leu lie Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 
325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 
340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He hen Asn Cys 
355 360 365 
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Tyr Val Thr Gin Phe His Pro Pro His lie Glu lie Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Glu 
405 410 415 

Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu Pro 
420 425 430 

Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser Glu 
435 440 445 

Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He Pro 
450 455 460 

Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr He 
465 470 475 • 480 

Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys Ser 
485 490 495 

Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys 
500 505 510 

Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr 
515 520 525 

He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 29 
<21X> 1631 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

•<400> 29 

atgcagttac ttcgctgttt ttcaatattt tctgttattg cttcagtttt agcacaggaa 60 
ctgacaacta tatgogagca aatcccctca ccaactttag aatcgacgcc gtactctttg 120 
tcaacgacta ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa 180 
tcagtaacgt ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc 240 
ataaacacac agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac 300 
gacgttccag actacgctct gcaggctagt ggtggtggtg gttcggtggt ggtggttctg 360 
gtggtggtgg ttctgctagc ggtggactta agggtggacc aggtggaggt tcaggaggtg 420 
gaggcccaca ctcgatgcgg tatttcgaga ccgcggtgtc ccggcgcggc ctcggggagc 480 
cccggtacat ctctgtcggc tatgtgaacg acaaggagtt cgtgcgcttc gacagcgacg 540 
cggagaatcc gagatatgag ccgagggcgc cgtggatgga gcaggagggg ccggagtatt 600 
gggagcggat cacgcagatc gccaagggcc aggagcagtg gttccgagtg aacctgagga 660 
ccctgctcgg ctactacaac cagagcgcgg gcggcactca cacactccag tggatgtacg 720 
gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta cgagcagttc gcctacgacg 780 
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gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacgttcgcg gacatgtcgt 840 
cgatgatcac ccgacgcaag tgggagcagg ctggtgctgc agagtattac agggcctacc 900 
tggagggcga gtgcgtggag tggctccaca gatacctgaa gaacgggaat gctacgctgc 960 
tgcgcacaga ttccccaaag gcacatgtga cctatcaccc cagatctaaa ggtgaagtca 1020 
ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc tggcagttga 1080 
atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca ggggatggaa 1140 
ccttccagaa gtgggcatct gtggtggtgc ctcttgggaa ggagcagaat tacacatgcc 1200 
gtgtgtacca tgaggggctg ccccatcccc tcaccctgag atgggagggt ggaataggtt 1260 
caggtggcgg tggatcagga ggcggaggtt caatccagaa aacccctcaa attcaagtat 1320 
actcacgcca cccaccggag aatgggaagc cgaacatact gaactgctac gtaacacagt 1380 
tccacccgcc tcacattgaa atccaaatgc tgaagaacgg gaaaaaaatt cctaaagtag 1440 
agatgtcaga tatgtccttc agcaaggact ggtctttcta tatcctggct cacactgaat 1500 
tcacccccac tgagactgat acatacgcct gcagagttaa gcatgacagt atggccgagc 1560 
ccaagaccgt ctactgggat cgagacatgg aacaaaagtit tatttctgaa gaagacttgt 1620 
aatagctcga g 1631 

<210> 30 
<211> 540 
<212> PRT 

<213> Artificial Seguence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 30 

Met Gin Leu Leu Arg Cys Phe Ser lie Phe Ser Val lie Ala Ser Val 
15 10 15 

Leu Ala Gin Glu Leu Thr Thr lie Cys Glu Gin lie Pro Ser Pro Thr 
20 25 30 

Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr lie Leu Ala Asn 
35 40 45 

Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys Ser Val Thr Phe 
50 55 60 

Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys Gly Ser Pro 
65 70 75 80 

lie Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr lie Glu Gly 
85 90 95 

Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Leu Gin Ala Ser Gly Gly 
100 105 110 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala -Ser Gly 
115 120 125 

Gly Leu Lys Gly Gly Pro Gly Gly Gly Ser Gly Gly Gly Gly Pro His 
130 135 140 

Ser Met Arg Tyr Phe Glu Thr Ala Val Ser Arg Arg Gly Leu Gly Glu 
145 150 155 160 
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Pro Arg Tyr lie Ser Val Gly Tyr Val Asn Asp Lys Glu Phe Val Arg 
165 170 175 

Phe Asp Ser Asp Ala Glu T^n Pro Arg Tyr Glu Pro Arg Ala Pro Trp 
180 185 190 

Met Glu Gin Glu Gly Pro Glu Tyr Trp Glu Arg lie Thr Gin lie Ala 

195 200 205 

Lys Gly Gin Glu Gin Trp Phe Arg Val Asn Leu Arg Thr Leu Leu Gly 
210 215 220 

Tyr Tyr Asn Gin Ser Ala Gly Gly Thr His Thr Leu Gin Trp Met Tyr 
225 230 235 240 

Gly Cys Asp Val Gly Ser Asp Gly Arg Leu Leu Arg Gly Tyr Glu Gin 
245 250 255 

Phe Ala Tyr Asp Gly Cys Asp Tyr He Ala Leu Asn Glu Asp Leu Lys 
260 265 270 

Thr Trp Thr Phe Ala Asp Met Ser Ser Met He Thr Arg Arg Lys Trp 
275 280 285 

Glu Gin Ala Gly Ala Ala Glu Tyr Tyr Arg Ala Tyr Leu Glu Gly Glu 
290 295 300 

Cys Val Glu Trp Leu His Arg Tyr Leu Lys Asn Gly Asn Ala Thr Leu 
305 310 315 320 

Leu Arg Thr Asp Ser Pro Lys Ala His Val Thr Tyr His Pro Arg Ser 
325 330 335 

Lys Gly Glu Val Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala 
340 345 350 

Asp He Thr Leu Thr Trp Gin Leu Asn Gly Glu Glu Leu Thr Gin Asp 
355 360 365 

Met Glu Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gin Lys 

370 375 380 

Trp Ala Ser Val Val Val Pro Leu Gly Lys Glu Gin Asn Tyr Thr Cys 
385 390 395 400 

Arg Val Tyr His Glu Gly Leu Pro His Pro Leu Thr Leu Arg Trp Glu 
405 410 415 

Gly Gly He Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser He 
420 425 430 

Gin Lys Thr Pro Gin He Gin Val Tyr Ser Arg His Pro Pro Glu Asn 
435 440 445 



Gly Lys Pro Asn He Leu Asn Cys Tyr Val Thr Gin Phe His Pro Pro 
450 455 460 
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His lie Glu lie Gin Met Leu Lys Asn Gly Lys Lys lie Pro Lys Val 
465 470 475 480 

Glu Met Ser Asp Met Ser Phe Ser Lys Asp Trp Ser Phe Tyr lie Leu 
485 490 495 

Ala His Thr Glu Phe Thr Pro Thr Glu Thr Asp Thr Tyr Ala Cys Arg 
500 505 510 

Val Lys His Asp Ser Met Ala Glu Pro Lys Thr Val Tyr Trp Asp Arg 
515 520 525 

Asp Met Glu Gin Lys h&n lie Ser Glu Glu Asp Leu 
530 535 540 



<210> 31 
<211> 832 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 31 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggaga 660. 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ct 832 

<210> 32 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 32 

He Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 10 15 

Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 
20 25 30 
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Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 
85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
100 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 175 

He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leii Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr lieu Thr Trp Gin Leu 
260 265 270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly tbx Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 
325 330 335 
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Gly Ser Gly Gly Gly Gly Ser lie Gin Lys Thr Pro Gin He Gin Val 
340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Asn He Leu Ala His Thr Glu Phe Thr Pro Thr 
405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 
420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Arg Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 
485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 
500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 



Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 33 . 
<211> 720 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Synthetic 
nucleotide 

<400> 33 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
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agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggagc 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tcdccaaagg 720 

<210> 34 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description . of Artificial Sequence: Synthetic 
peptide 

<400> 34 

He Leu Leu His Tlir Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
15 10 15 

Ser He Phe Ser Val lie Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 
20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 
85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
100 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Qly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 X75 

He Ala Leu Asn Glu Asp Leu Lys Thr Txp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

lieu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 2X5 220 
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Leu Lys Aan Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 
260 265 270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 
325 330 • 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 
340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 
405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 
420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

Pro Ser Pro Thr Leu Glu ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 
485 . 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 
500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 
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Thr lie Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 

<210> 35 
<211> 1091 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 35 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttca acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggaga 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780- 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagct tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtggtgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcggaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccaccggaga 1080 
atgggaagcc g 1091 

<210> 36 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 36 

He Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
15 10 15 

Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu J^p Lys 
20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 SB 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 
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Val Asp Asp Thr Glu Phe Val Arg Phe Asn Ser Asp Ala Glu TVsn Pro 
85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 
100 105 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr lie Gin Val lie Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 
165 170 175 

lie Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 
180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 
245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 
260 265 270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 
325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val* 
340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 
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Asn Gly Lys Lys lie Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Asn He Leu Ala His Thr Glu Phe Thr Pro Thr 
405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 
420 425 430 

pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Arg Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 
485 490 .495 

ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 
500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 

Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 37 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetxc 
peptide 

<400> 37 

His Tyr Ser Pro Phe Arg Gin Leu Ala 
1 5 
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